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4 7 V h W ^ T^ff -f Sfc»^-^7y ^Wf^^ 

[»*5i3l wiasrfeds, ^*#»Wt y 

tW*Jl4] ^^^M|E#^\ *J*B«<0*:«> 

t «r»« t -T S if 8 tc !B»tf> a v tf ^ - * «E» o w 

BJHfc. 

t i- 6 w** i \mm<r> = > t: e A - * her o wf mm 

i & £ 1* 5 !Ht#S s tcfaife^o =« >- tf ^ — 9 MM V "Jf 
BMEtt. 

t 6 9 ic|B«t<o > tr ^ - * ssjr «y «rte« 
[B*9[l i ] *K£*Lfci*fc:, 

3 >- b g a - # lcffi*8 £ 5 ^ > fc* ^ - £ SSB* n *T 



(2) 2003-131772 

[IS^jfii 4] ^ 7^ry b/f— b!7— ^-c 
fc ^ 7 -y r >- b ^±^Atff— #mo~7j k 

.20 

1 4 tciafeco^fc 

6 ^ 7 -f — jv KM Hig#St 5 x x y $ ib tc*tf 
tit* 1 !! 7] ^ 

9 ^ 7 s h 7 s ^ * X*mTf-t&tc#><D--? — ? 7 y 

— *7 */-?m*K »l»ffl<0jRtt4fctt*»*r*"+*^ 

8] gJISg*^, *^BNt, 9»«BM, 

l^ofe^^fcOT^^^<b%#m^t-^RS^l 7ic 
^ tr » - # i^H 9 ^ tt!*«f*o 

tfZ. t tt5»*fl 1 7 MffisE^ =• > f - ^ is 
^ IR9 RfUBJItflE. 

[1^2 0] MUEJHU^s BIK&ir^^^-h** 

50 ^ft^^yea-^^tPlilMo-C, flOsE"^ 



(3) 

3 

{IS*3l 2 3 ] gJjfS^- 9Ty /IW^ HTML, 
XHTML, c HTML, XML. &cfctFWML<9 5 t> 
1 oSr-g-tf d 4: 4r«Wi:i-5|»*B 2 2 fdgflft^ 

IB® © = >- e a - * Wft o pT^K^c 
I o o o i ] 

[0 0 0 2] 

■r (PIM) ftifco/hSl^^fjx— x-f ^x^-Yj*, 
KB, *JJ:t«fl*WBii. A^HttSBtt*-?*'*-* 

I o o o 3 ] c^i^^yt^-f^y^rVw^^ 
< , zi^t*^— 7^ >^^v<^^^/%r>^->^jt-ei^-ij 

[0 0 0 4 ) T^T-ft, VoiceXML (voice 
extensible markup Iangua 
ge) Sr*«-t-5Jt: 5**^tK— ^/WS. SESS^^SMfc 

m\^T<<>&-~*y b ^ >?■ >y\zr p ± ^x$ Z> £ ? 

>" hi^- /< Cfci Web 1*— /<) v o i c e 

XMl^y^/j) ^£^LT? 7-< T> 

$:Mt5 ff We bfw^itiC^lTVo ice 50 



#111 2003-131772 

xml K^^y^h^Mtsr k&pJMxhV, zti 
ft, tmi^L^^ K^ffiffl tt, Web 

t0 0 0 5] 

[»W3ft««HfcLJ:5A:r*il«] VoiceXMLii, 
f»J»li> y ^f&itftfrlSt;*^ y 

Ir^HTML (Hyper Text Markup 
Language) yu — »J»^7*/M£«£ 5 t> OTil 

tfU6. vo i c e xml ft— mz, m^wm^ 

^BB«^ftffl"C#Sr^y ^-v-g >-ttl s Vo i c 
eXML*r»aMa^5&tfJCtt, §M&#29«, Vo i ceX 

[0 0 0 6] Ifc^ot, -f>-^-^^ h^^OI^— 

co^^^io, ^^v^tl-^r^^^-t^w 
e b^--* t- y fcor- **r? T^rteX U 6 **^*^* 

[o o o 7 J 

^T^WCIt ^7^7yh7^^^LTA* 

[0 0 0 8] d OlEKSSi (extens i on) s 
Tv^xtt. A^f^^§(??7-f-;i/K^t5We b 

I 0. 0 0 9 I WHS*— f^^^T-T^^MC, A 
^70> f7"^<-r ^JfcttWe b 1^ — 
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5 

[0 0 10] 

(PIM, PDA* if) ©»»«*3 QlCTF&thX^ 

^y^q^ AAtf^flfcif© 

[ooiij ^awM^** * 3 o (ogy^^n 

co IIP] ^ffifi £ SStRft ^ -t h ft tf> , & £ V SS toftii 
<7>#&T* * -v * f3:#^# <f^^y KfllWS: 

[0012] mm 2 Sr#BR-r« i , »f>W * 3 0 

(CPU) 5 0 its h^atTfBHBWMBS:* 
JfrfSc CPUS O^fV 3 4^^tlt^ 



(4) WI2 0 0 3 - 1 3 I772 

c p u 5 o t jowwci** ShfcW /»& 

^V^^r^ir^^^&y (RAM) 54f£ v CPU50 
^^^y (ROM) 5 8te*Mft**ia. ROM 5 8 

10 0 131 RAM54(t T7'!)^3^n^7 

20 [0014] C PU 5 0 

^£>. 9*— ^^E^^^n— F1" £fc#>l^ sj-y^ 
si ^Oagft^ — ^^-^ 6 0 £ffi«rr £ - t fcT*^ 

[001 5 ] SKl-r^^^ ^30(1 -7^^n^^>"29 
JO fe^Utr^n^^^ (A/D) SHft»37, /^e>t^ 

A (l 1 ^, DTMF S ^x^ff, ^fc{^^^ 

tLT, W^7^>2 9WIItBiU m 
dSA/DSE*»3 7tJ;o-C^/HKSlx5 0 

5) (Cg^^^fc^tC^ft^W^ 3 OtdpSH, We 
b-y— /<2 0 2 4o£l>W^irV^ 730^^7^7>F 

50 ^Mti-s^fc* 4 ^*** ^x-^^Pfl^, 



7 

BIRa***-?^ * 3 O^J;U?/^fclWe biJ— s<2 

0 2©JV>4<Hio|:lt:HSTl6 B Jlfli 

E o o i 6 ] B 3 tf\ SSttme 8 o ^M^ttntK^Bfr 

[0 0 17] ft*£DM^^fc^#l|!)^^b: e ^— ^-^ 

^7tmi^^0i]^fi, ii&omis <bb&l) . ;<-y 

# S/ ^ ^ , 7°o ^ 7 AT5JSg^*^»«;^.S{fp. * * f ? 

1 0 0 1 9] &{c v H4 fC^Lfc^il^Vt^--^ i 2 
2 0 t>^®^=J y^v-r^ v^S^O- $Hc*§^-f s 

m <r>&m & & »«ib o n h l t h s> t>*<n m m 

[0 0 2 0] ^yt^^-^iaoT^^ 
n ? a ^ ^ — /Mc fi % ^^co * * ? £ i^f f 5 1 > 



(5) 4#M 2003-131772 

iM~& & i^f^Sfliftfc^ft B*43j:t«IB*:, 
[0021] B4Sr#JBi-5£, =»y^~^i20fi? 

SSifilans/ h 1 4 0 \C$&&-t%> Z/^J*s<* 1 4 

Sr^f^t, ^5lfc7-^r^ft(CEl Indus 
try Standard Architecture 
( J S A) s<X .Universal Serial 
Bus (USB) v Micro Channel Ar 
chi tecture (MCA) ££31 ISA (E 

ISA) -^J*, Video Electronics 
30 Standards Association (VES 
A) ct— *Jj:Uf^*'=>-^^i: t>(*tfn-5P 

eripheral Component Inter 
connect (PCI) /^tfiiilS^, Cl^^lC 
PS^^nS t>0"Cj4<Ct^ ^>-t B rL — ^ 1 2 OH 

-^lSlt^*{Cfi. RAM, ROM, EE PROM, 7 

7 s/^^^-^y ^-yjjtB, cd-kom, 

v"^^ MK.mT'^ 7^ (DVD) S^iliSfeoojt^-f 
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9 

[0 0 2 2] SftSGfm. ^EjMi*, HHSat£«fc»tt 
©K8^ * = XAft if «**«^? fltWT, 

[0 0 2 3] ^^A^^eU 1 5 otcil, W&L*)M-m* 

(ROM) l5 1|3j;tj«7^A7n^^y 
{RAM) 15 2W© S »»tt*5i:V/*fcB:^IW8 

IWft^^t: 0 ^--^ l 2 ort(7?SI*IW"e^fJf$|iteai 

3 (BIOS) ti„ AattiCftROM 1 5 liCftjlftS 
;h,£ 0 RAM SPH#r^ir^Wi^ 

[0 0 2 4] =«>fcfzt — ^ 1 2 Oil, ftt^ft^Upfte/ 

»^ L?T=*, iwwt/^iisstfe^ => >- if =«. - * eww* 

^ K^-T^l 7 l s ftbW-CD ROMifeliftO* 

j£-f^ftv\ p|/Tifi4»(m*^ttfflt5 r t <dt*# 5 

■r— ^ yy^K^r-hRAM, yy>KxT— hR 
ft^\ /^KfV^^ K7^7"l 6 1ft, l&^l&Kit. 



(6) WHS 2003-131772 

~7x^^U^7^ 14 1 t^fiS&fts 

I 4 1 tC^^iX^ 0 

[0 0 2 5] ±|E"C#^UH4^^LfcKy^y^J:O f 

^16 1^t/-f^^^f A16 4, TT*^ 
186, &£Tfzfv?J A?-^ 1 6 7fe*lftl,T^<5 

15 5, ffi£?)/P^7A^^a--;H 5 6 s iJitf/o 
^7Af-^ 1 5 7iRi:"e*)5i*t*4S»*t^ 

20 y/n^7Al 6 5. ft^o^A^a^H 6 

6. ^i^P^/7Af^l6 7int ^ft<<t4>JR 

ft 5 =a r 1 £^l-fc*fc»ft5»*a*#;t& 

[0 0 2 6] au— Tfi, K182, ^^P7 

^120^At6wi^t*S5. teOA^fVM^ 

y h l 4 0tc»EsK*^»*^*v^, ^9 t^tf— ' 

K K £fcl*U niversal S e r 

ial Bus (USB) *WtliWy^-7x^ 

t"tM — x l 8 5te£<D4 — y ^ — 

■frLXisx"rJ*/<x l 4 1 teSMfc^itS,, a^ejL — ^ 
^t? S3, ^^U)PX.T, Ml^y^-7x-Xl 8 8 
^ITglf^r 4:^5 7; t"— # 1 8 7*JJ:^ 
y >^ 1 8 6ft^ffi^SI±l73^^^fc^tfw t*s 

[ 0 0 2 7] — # 1 2 ptt, y hV^ 

— ^ 1 9 4ft<Jf<0 i o^fcfllS^:^ D h ^v^» 



11 

— $ (LAN) 1 9 1 J) T*y 

— ^ (WAN) 19 3^ft^W, i^*yh!7 

[0 0 2 8] LAN*yM7-^y^»-C«[H4h5 
T^:/^ 190 Sr^tTLAN 191 k:^ 

h^ifOTAN 1 9 3^^LTil 
«**ft^5fc»0»<&*ft**t>^ 9 2**1*3 

:x — * 1 8 O^/cfli^ili^LT^ri./^ 

xi 4 i fcftgt-r-sr. h 57— ^aiasTf 
tt, =»>-tf^— ^ 1 2 o tcBaauT^stifcT'ci^y a 

(0 0 2 9) B5li, **W-e*lfffrie«^ We b-<- 
^IM^T-^rf^t 2 0 0 ^^tit^6 9 -|R 
Kl. We b^-/<2 0 2lt»(flSiXfc«f*{Ctt, ^tbx 

3ft We bf~/<2 0 2i>>£><Dmn&, 

1 0 0 3 0] *e>^Mttrttc 7»^f^f^2o 
0 ottJ:<*afeixfc^— j^r (fctiSHTM 

L s XHTML, c HTML, XML, WML^W <7> 
«»C*feix5J:<»feixyJ:GU I M4rf^LT7n 



(7) *5fBB 2003-131772 

[0 0 3 1 ] — ^3011 Web 2 
0 2«tf^HTML^^ U y b&^4rHfr 

15* co j; p [c^-r ^ 3 0 tc J: o T pTttA^^ttMS 

SSfclitS^T^ffi^^li, IBIBf— ^2 0 4ic 
2 0 4 <7>#llto# < co^lfiSrBfe 

-"X^^Lrlf^^^^/^-r/v-l-^ ir, r>M^3 0 
[0 0 3 2] HI 5 <fc 5^ T^-l'^ 3 0, W 

/<2 0 2Wt-A2 0 4 4r*tr&»i*i\, 

^mtffi^-^2 0 4oiij«3icrt#4:*D6 1 ^ifft< , 

^ M7-^ 2 0 5lca**-*r. ir^nj^teD. -e^tt 
iot, We b*^-/<2 0 2-e«*S*x6.EKftL^, 

We b*^-/<2 0 »(^7^7y hifjO 

^0 tt, We b^-/^2 0 2, %%mV-^2 0 4, 

it^^t^T^5 0 h^ffi^v 

H«t-^2 0 4 *:*frtf^ kF>V£ 
BfH-CfcHtf, Webt-^2 0 2J6i:a 

[0 0 3 3] &%£m<r>— mWii-Zs P 74 T > V*f'<<< * 
50 §IH5^^rs^-^T 
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IB 

y &G1rZ> = >'X=.~jrm'&9 *f mure & o 
[0 0 3 4 J ®g£8 0 Sr^hLfoWe bfw<2 0 2-><7) 

!7-^ 2 0 8^<&jftRj&s-£**L, r^BB8 0trB= 

fy-F^x^2 10[CSMt^ y-h^x^210 
Jl> miS 8 0«rtl»^9 £f* 2 1 2te*JBEi-6o « 
K*)»y^^1f2 1 2fctt\ «B-f 7*— ^»J; 
tf#^^Cif 2 1 6Sr«tti-5^f -<T^w<2 1 4 

a*Sr*ii*. ^w*3 0£PS«k:, «b#j*:/ 9 917 s 

✓<2 0 2*"b**ta*. Skl^SSft-^s HTML 

-Cfl, W e b f 2 0 2 ^-fV^ 7 3 0 J:^!!! 8 
U I ?^7^ h t»*fcf ht$ifiMCbftv^ tr 

ea*n*"srBffl"*3&^b<?3#)*B«*Ss h?-* 2 

0 5£fcill|fflim3ft2 0 7©v^i"ixA**r^UX, fc^x. 

h:tcp/] p^jilt, *j»>^?^if2 1 6 3&*e>B 

it^2 0 4i«$M o Webt^2 0 2, U 
lt^2 0 4, :fo£tmt£1f^y 9 If 2 1 2te s [1 
4 ^ S ^fciFL/B^ * * h * V feTs. - * ft soffit 

^ =« >- a - x ^ > '/mm-vMMi- %z.t&x% 

[ 0 0 3 5] fc*:U DTMF««ffl^^«^ 
C<^»B^B«tt-«t-, H»t^2 0 4 T**ft< ^ 
rV7fw<2 1 4^^tf^H5^^^M#^^cV\ 
*V DTMFtM^f^7f-/<(aoi: 

10 0 3 6] i^oj:^^ 

h/*— ^<T — ^^^^tcBBtl^ai-SIWWds 
J:W/*fc«^v?*^ Hr*ft5fc»^ HTML. 
XHTML, cHTML XML, WML&ifff)^ — ^ 
Ty^WBWttBlBirfttrJK $fcllffi«^fikG0SGM 
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/4 

"CSS. 

ro 0 3 7] saiw^Ji^/^fc^yi?^^ h 
tts BB««^#ja> BB*«<z>*fi\ jsj^/s/tfi 

«*HW!>fc»iD, BBSSB^MIWdsit^/Sfctt*^^ 
^ hi, iMfi«^ J:^^ y / ^ «?)B4 ©fc 

— ^©B^tBff^-^r^^WB^iBJni-*, ba 

20 -/<2 0 4T?«fflS*b*B»*5J:U t *^'fr*7 p 7y h^ 

[0 0 3 s J BBl^ffaftB^feiV/tfett*^^* 
^ h«r*-r§^-^ry^flrB*cov*xat^5«IK^ * 

- KWSRSrWe bif— ^fcC»«i-£ - £ Sr 
-a&\ IWGUHy^7x^^^T^6o 

a v Mas terCard, ^fcf^Am e r i can 
Express &t\ ffifflStt^^ U"^?/ hi?- K«D£ 
^ytrA^^^fc^co^-<— ^K2 5 0^^tv^ o » 
2(07-f^F2 5 2{£^ u-t?5/ h^— K«**rA^-t" 
S r td^mxfo y> s m 3 <r>y a -fr K 2 5 4 ii#Sb« 

P%A^i-^>- t^pJ^r^S. 7^f^K2 5 0, 2 
5 2, *?J:U t 2 5 4(CA^)^nfcif#^^i-^fc^ 
^c s f-y 5 y h ( s u b m i t ) 4^y26 4 dS««S 

40 [0 0 3 9] H 7(1, 9 =74T^ bfrhWli&<D2 

^®i"6J: i 5}c, =-Kteii*s:»»2 6 o*5A^^ 

y -f ht$ft2 6 2^$tl6o ttM2 6 0M, * 

^-;i/K2 5 0 % 2 5 2. fej:l^2 S 4 ^r^1-ffilS?T<0 
^— K. ^ibl>nci^y^ y h^^>2 6 4 <d Ki£* 
^tv^ (IH6). r*n«T?H, -Y-O-^r^V^ (eve 
n t i n g) IfzK' — HiiitfBftaSnfc^^ V?h& 



75 

ffim*f%t, It ^ver i f yj ftWXPmtStlZ, * 

±.\**? yy hit ft 2 6 2T^ff^^5o r v e r 

ifyjfc W — MVisa.Mast 

e r C a r d , ^^Am e r i c a n Expres 

E 0 0 4 0 ] B9 8 tt s *!MStt4:ttJa ITWebf-A 
2 0 4 te*ft£tifc£ W*s> — Kflf**rBk»i-6fc 
#>^, g]5 tDGU I ^l^li:t)0^t^^7^7V h 

[0041] —m^ trains (-^c r^Vj tfcupa 

{c|LT«ffitacH^T'|^^^y h*5j:iM y* F 
TML, XHTML, c HTML, XML, WMLrt 

btiz>* ^wm^^^xmi-rnxmrn^m c 6 

[0 0 4 2] rrx\ ^T^^ds^ffStts 

)WRt 5 #ac^tcfcif$tifcv\ Si(?)^K fob 
J e c t mode] til, ±»SBj&S«fflpTffiT?*>^. 

XHTML/7>)fOJ Script'f^- 

t^T> h 5 J: pft^p 

object mode 

•P«r ¥4 T v ? ff£#fc* x 6 fc a6 to £ "J ffiixfc J«ffi SrJt 

£if r7y/^^y7^f (u p ] e v e 1 b r 
ows e r) ] tP^^^o :«^7^f(j:, VL 



(9) #12 0 0 3- 1 3 1 7 7 2 

[0 04 3] t£^g{Sil % Tdeclarative ra 

j$ £H<5^|r§\ declarative mode T'St> 
ff-J-S^Cr-^Ht W^U^/l^^f* (down! 
evel browser) J £ WlYu -f^^x ^ > 

f!i^^l:TOL, SMIL (Synch 

r o nixed Multimedia Integr 
ation Language) 2, 0 & Klfffl 

[0 0 4 4] K^A*Klol^T»Cfc 

fill r ^-e- F'tof-^A^f-fi, «R 

30 -*?l$mi IC7^- /L- K2 5 2{C^ h^7— K#^" 

IrA^L. ^C7^-/UK2 5 O^c^ K<7> 

T^M7^/WK2 5 2[wIot, W:^fzAt)&i&lE 

^ modal] irispfftt^p 

[0 0 4 5] W*B 8 t % HTML^-^7 

&J;U^* y /l»2 7 2*S**H^ $ ^>fcrEl 7 ic 

^■gr*HSo #7^™/VK2 5 0 s 2 5 2, ^lt/2 5 
4— eoflJ«[<7JA**ix ^K-e*tt=3— KSS^2 8 0 V 2 8 
2, *5i:t/2 8 4 {^i or WJW*fc(i*ffSix^. Mc® 
50 {z^~ K®* 2 8 0 & L, t ^f^X-Y 7 3 0£?) 
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17 

t, Y T 0 n C 1 i c k J tfSjES&SitK wfrfl** 

^^f»2 7 2 xmwc r t a i k j «rP««iai-J&** 

Fmu 1 t i mo d a 1 j fcPfHSv »^A^&££ 
[0 04 61 #<©7^!I^>g^s y— 

[0 0 4 7] WKSCtfcteJIS i , JSttts &x 
£H#o ) *9M**-ctt:, r^tifej *-**s BSft^ff-F 

#tpfrbi^W2 9 0 CfclBifflS-Ctt: f r e c oj £ L 
ndj ftWT^HT^^ 

0 2i)^h^y4T^ M^^n — K^^6^" K4:W 

fc^l^W-y— 0 4 fcSfeft*. gramm 

[0 0 4 8] W^Hfct^ i***^*, ^ 

^ 2 0 4 ;&>^<7}|8*iJg 

K1=>»£* r e c ofMM2 9 0«/j\ « 

JfLtf»^Tt6fc, recount 
^ffiibfb L . Rtt ^txfc-r^xhi: fctjfc-f 57^-^ K 
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/£ 

U «-7^^K2 5 2^5^^2 5 4 lC-I(50r e c o 

N£§ftlX££s 7^-^K2 5 2^J;^254^ 
jh,fcH«WltfeiT/S« *-K#t7>f^K2 5 2©S 

<^)^T, IS fhand Uj K^^^SrSSP 

[0 04 9] — ^T^^2 00*SJ:U^ 

;o »£\ 111;, ^itbttSW^temntttbnfc^^- 

fea#**bfc.V\ I" o n C I i c kj ft^C^iSt"^ 

nCl i ckj -f-<V hcoffiffl^RR^S^^feco^l** 
20 k&mB&fct^\ fonSelectJ 4^ B 

IS ^ilW t57 P 7 t >f± t? + ^ ~? n ^ 9 A t o 

2 0 4 36*b©«**»fP^J; V/*fc tiffin Srsfit 
^6t)^S tart () =3— /V % ^fcli^^>-U-<^y^ 

[0 0 5 1 ] tta6<0«T*f4, ^yu^^-^wr T^y ^~ 
fe(ft<arj#»*+5ri^j:-=»T, ^<-Y^— ^A** s 

B»*w<2 0 4*fc*±^^-rr>'^3 0fi, bk 

S: ff it U T ffl* Sr ^-t" H# ffi ( AM W « Scffi * ^ t fcy < 
t^i?)*)5Pl (dictation) *5j:li c ftOv' 

50 5) s fl^Wft#ih««t^BFti-SWt"X?35t<, BHt 
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[0 0 5 2] —$mmWr?&s R e c oK*fl g<D£ 
^T^)3o^Iij|^ K%EgiJ-r§fc«>^ f ffl odej 
To n R e c o J ^-O- hO^#t^fdfl fb indj f 

TautomaticJ t 

[0 0 5 3] Bl4li, *J*B«e&fc«>© r a u t o m 

T»# <5) o BtPlHiM 28 1(1 BKW— /< 204*283 

^2 0 4^2 8 5 T7»l*«rKkttl U 2 8 7 
TLfz^b U fc fc % ^ t? £ . 

[0054] Reco Jfejftttll ^lt^-^< 

2 0 4 liifinitialTim 20 

e o u t J 2 8 911 mm<Omth2 8 3d>k*J*(0lft|±| 

#H*Jfc Lfc^^r^x.^ To n S i Uncej ^-<V h 

2 9 itmm-v—^z o 4^e>«#t**t** /< 

2 0 4*s»K'Ct3ftV^38IS«:Jlo*t5i, I" o n N o R 
ecoj >f^h 2 9 3^8fi ! Sn5^ £*W*Blfc;&5 

[0 0 5 5] B»fc(»ih*fctt8it>*i-C:4:*s-C*^* 
fOMi^fe: rbabbleTimeoutj JR|£ 2 9 5* 

j& H±J L ^ %gm & K £ * *U*tt £> 4 V * H# P9# -c * 

4 LTf^^lLT^5^ fci 

KlgV^lgcD*8\ TonNoRecoj M# 2 9 3 * 
^fT^ttS. fcfcU ftl^lWlST 1 fbabble 
T i m e o u t j JKtt 2 9 5^tfcf^, BttttBO 
^ — T'&& *TEgf£*^< „ TonTimeou tj 
^^ 2 9 7 *»*TSixS, WI«K, r m a x T i m e 
o u t J Mil 2 9 9 r £ jfrlpTtt-e* U . ^tl 40 

in&m<nmit$ 2 8 3 a* 6,*** ^7^ryh3oi:i$ 

n5*T?<7>S»IB#tC|ai-5t)«0-C*>So ^ CO B* n # fe ffi 
5 £ s UnTimeoUJ h 2 9 7 *^|f 

[ 0 0 5 6 J tctd U TendSi iencej JKtffe 3 
OUf? fc*^M*flBfrfcjB;ifc»*{l B«#3ET Lfc 
- t«rl»t^-*-t^'t > »)9, BBi>--'<2 0 4f±RIRir 

14 , BUMS* ^M-T -< * T*> 5 a* if 5 A=*«*:f a fc * 
O^iflilj^ (confidence measure) 50 
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fonNoRecoj Iffi2 9 3« 

fr£;fK JfcflljBftdsutv^J: 9 fcH^g^ll r 0 nR 
e c oj JHte3 0 3feitJWaiie**»ff mi 
4 il Tautomatic modej T\ 
stop {) fr&&ft£tlX\r^j:\t^ <t^^-f 0 
[0 0 5 71 01511 2 0 4 CD f s i n g 

ie mo a e] mft&^-rmmmr-hz* r a uto 

WStit^S, fcfc"U rcolljf^^—KTfl sto 
p () 0 5*B#Mj»2 8 IJif^S^T^^. 

stop () ^-/^3 0 5*1 r p e n- 

upj te¥<D^s<^ hteW&T2>h<D*V&% 0 ^(DWlif 
^r- KT*n BB***rJH-C fcsSS, W*S&te stop 

t I^^iC rinitialTimeoutjPf 
Pfl#2 8 9rtjennasii35i;v^»frtt r o n S i 1 e n c 

Jl BWaffJfcSftft^, stop 0 =f-/u 

3 0 5Em©»I"e*A^B{i:j:cT4j*S*i(S: r D 
nNoRecoj ^ -O' h 2 9 3 fl BS^^ffjJb 
V\ fc/fU rbabbleTimeoutj Mtt 2 9 
5 &1t& TmaxTimeoutj Mtt 2 9 9 (CBBitft 

[0 0 5 8] Hi 6 H fflm-V— '< 2 0 4 CO r m u ] t 
i p 1 e modej tt^Sr^"M^li0-efc ^><, fj3i"C0 
^ ?t-. wO»^— K*i To p e n -m i c r o p h 
onej <Dm^tlita^>^-]) $-T«&ffl$*h& 0 —ft 
{dCOl^^ KT^t, »^W*s t op Q 
3 0 5*§ltFXbtt^^\ SfcJi r b a b b ) e T i tn 
e o u t J Ktt2 9 5 *3t(i TmaxTimeoutj 
JKtt2 9 9(cHai#rte>ixfci«B#*:tt^e*-e, 
BB»*tfK*;h,5. fcffL, BBSrffih U*v>ttito 

fonSi J encej -f F 2 9 I v ^onRec 
o j 4 *<> Y 3 0 3. fonNoRecoj 
V h 2 9 3 ^babbleTimeoutJ «fH 

iti 5 r ma xTimeoutj S^pal^^cO ^ -^li 
Wiry hS*t*wt«c«JR*jh,fcv^ 
[0 0 5 9 ] -~fl£f-s coibf^- k-cii lf«^4afc 

^^Co^t UnRecoJ h 3 0 3 &Wfs $ 

stop () 3-/1/3 0 5*gtf]& t)tl5^T« 
*KSjK5. BKfc"C#*V^«IStCJ: !) fonSile^ 
c ej h 2 9 i**fr*;h.fc»«\ ^ix^>c^-f-< 

>-H**# Shifts, BBitatT-rs, 

to 0 6 Oj m^(OX o 7 -i—/v Kffl<OBBiittttk 
tor e c o^-^*^ h*K»*iX, Z^iC{^>^< 

it, jcfesrffiffli-^BK-y—^2 o 4<om^^mpk-t 

%Zbfc%t*L&* ^commit, fy^T^h'So^m 
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tmn-vn, iwt^2 0 4iaoTSij»^5f 

[0 0 6 1] BBl^-^2 0 4 #>bBIRtt**:9:Fra& 

2 0 2Kl3il&H& 0 ttit©ft*J&*fc> We b Ihw* 2 0 
-^2 0 2Xteft< L5^i®5i^*"/^2 04iaoTH 

(TSfta^fcjPWfej&^fciS. fc«u *»«tt, B 

^_^ 2 0 4OTe b*w<2 0 2 fc*iaSBH£ft£ 
£\ 0 4^^7^f7y h 3 0C0— U 

0 4^We bi^—^2 0 2&fcl3:? 7^7^h30 

[0 0 6 2] dJ 8 (C7F$ixfc^^(c^H^UXV>^ 
^*s % r e c oMftidtt.. BW*#7*^-#£B»*- 
0 4l£fa#Xi££fc#cD, y^-fWlt/^x^ 
h (RAO) ^r^tpr £ tt*§5, RAO^f^^ 

±7*?=.? hfo-r&z tcnm&ft. %P<4 

-So 

[0 0 6 3] B9*5J:WBi Ott. *W«B»*elHltSit 
fl;ifcHTML£ LX^iI!Xfc5 0 mfrbmbfrttX 

ftfUAfcKSJk i"ft*?feb a r g e i n # ^6jRtt&l£tf 
7 P ny7 , hW3 0 3«5o fc*TU Bl 

£ 0 Z.ZT:&"?n±Xl$. ^7^»^K^#gl, # 50 



#H 2003-131772 
3 S ^^y^hllife TcheckFi I l edj |c<to 
^^itW»fflr$X&oj&^sBBi^^<2 04^ 

is3>X^ r 0 nNoRecoj tt, BBUXV^jiV^ 
c fc*»i"Bfc:a^*ix*:7-Y— K^n u 1 lffi&A 
;ft,5fatX&5£i£>s **u«±0»fWiiK»ftV\, 
♦floltlBBB-ett, TonNoRecoJ 3 0 5^1 
ffmumblej SrlWffii-J&^Sfctt^ffU, i£^&r 
B»* ta ^2 0 4 fcteaiLfcSL Ctl^^^T^* h 
A^f^©lft^fA3 0 7 (B5) ^lt, 
*l»^*aiStu5 a BBiT— v*2 0 4OT*^ h 
*:«B»^y?^lF2 1 21CIU fttcrftjWWSS 0 
fc^3ftT=--lWB< W«Jc, *w 

fc . t # fcBBW-- '< 2 0 4 K J: o X WIS* h y 

[0 0 6 5] rtO^X^s BBS f w e l c om e j Ikfr 

ckFi I ledj ^f{^LT#7^- ^Kto 
V^X^07 !, a>'^ hfettiU A^^tLfc^ KWKtt 
43 £ iflJHRj&SE L^ri: ^BB*r*tfB«I*Xift*rfii!i 
tOTifcoT, CftlCte fconfirmatio 

[0 0 6 6] &&ffl<ni&<Dm$&t Lt, S^5^^^<?> 

{I^IC, We b t-/^r ^ y > a ^<0M4»»t 
4 < »# iitf - i & Xt « o^tf Ux*> 5, HI 8 & £ 

HI 9 . H 1 0 BB« r h a n d J e J SHfcffl LX-^J 

[0 0 6 7 ] g|9tC^^$^X^^?S^^, ?-^7yr 
IlliCfl DTMF (Dual Tone Modul 
ated Frequency) MS*3i;«l»Si 
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*Oit«MB^SS6)«C2o*>^ DTMFiir ecoifl 

[0 0 6 8] Hi l^cfct^tai 2ft, 

**t*itf-3-"-i?lc:»Sr|lltt"r« J: 5 i^lif^r 5 r 4 

fttftCtm*. r^3)Htft^ (mixed 

initiative)j <£>— t&l"Cfc6 0 — JSxtdCltD 
ifetl^MM (dialog interaction) 

[0 0 6 9] mi l&XTfm 1 2 0»lrtlt Td o_f 
i e 1 d j t LT^SftfeASfc, Tg_ c a r d 

typesj, r g card n u m j s £3 £0* 

Tg_e x p i r y_d atej (CflBaWit feilfcflMB 
ClCD^jT'ft, r 0 nRecoj 

fci^-T^rcoT' >r -/u K(cH+^«*BBiiW«1' 5 r <k & 
[0070] iS7 % HI 8, H!9 V 010, Hi l , fci; 

xmi 2 Til, ^M'^iam^tzwebmmy^-^rp- 

*(dffliioT^5, $e>id. f-?**i7P-INKfc 

*fcJ±«*^^^y^^ (mo 
da l i ty) R (GUI We b -X, 

rv^ 0 ^ccmaot, M^f/^^ 3 o <tpj 
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tmmh^mKvx^^ itmAxn, ±mxm^m 

[ 0 0 7 l ] JiiBOJ: ?^ T ^/^y? ?if«x 
9 *TSfc»^ ±E^«TB8« Than 

d i ej &wx*mT z t &&t*m* #&m&m*nffT 

/<? 5 J: 5 Id, fbindj K*HB«8S**r»WLT«[Sr 

W!)at§t^t*oT, rb i n dj mm* r r e c 

»M(Otto>***«*fctt1li»-CH, Tb i n dj 
filSfffiS* ( r r e c oj MI) ffcot, ftWJt!? 

20 b&X%5>* 

(0 0 7 3] x# V 4 ^f&tizmL.teb^ 

— -><2 o 2tc5li-r Iff L v £ 
^7^rr> F r / V ^ [:llft ^ w i o t cO^M 

[0 0 7 4] tt*B^iai»1lll?tt. fbindjj* 

*dflJ!3 STSfcftOjRtt TTargetElemen 
t J ^b v J:^ TTargetAttributeJ 
*-t5o f&<?>^^T^, Tb i n dj TTa 
rgetMethodj t>^A/"C*5 9 , Zfr&fry^x. 
? y y K*aL^K:iattiS*b5, riargetM 
e t h o d j 0^^5j:i>m^l, 7s ^ y -f^r^iy^^ 

flSffl its ^-^^^^ h tob j i j co rxj ^ y -y v 

<bind TargetElement = "0BJ1 " TargetMethod - "X"... 

> 

[0 0 7 5 ] »fTvT^^mHTML/XHT 

mlj^> hmxw&y k<oxh%%K SiMfr^ttiff* 

"Cfctttf , W3C Document Object 
Mode) Level 2tfc[tLe ve ] 3 -f 

"S^WM, ECMACommon langua 
ge Infrastructure (CLI) -f^ 
f^^>. Java (SftAtR) yp^y^^f^ 
5(? ^>-F€f>, W3C Synchronous Mu 
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ltimedia Integrate onLangu 
age (SMI L) , *3 <fct^5 0 ?&$t &4%5W 3 C X 

OXfefe^, <b i nd> 

[ooze] hi 7&£tmii 8i*„ ^-rr^K M 

ffctfttCte: downlevel browser ±t?^ff 

[0 0 7 7] — ASIC, mi 7^iU«Bl SO-r-^T^y 
4, 3 5 6, &J;TJ*3 5 8*****l«o * V a V 3 5 

y - a $ fc mm b&xs*iitk # iiiiunex & * ft b* 

PJtrf*?*-9 b tsmexj OTt5«(|it^^3 
[0 0 7 8] WSSoi^fc, Tb i 

n d j mmfcitt? & h ^ y y k«±s 

CUh^* 3 6 1 Tr Twe 1 comej ^ b£> f s 

t a r t j ^y? r W e 1 c o 

[0 0 7 9] #c^^— 1^3:, 3 6 2 T* Usked] ± 
FOfstartj ^7 K£r^tri~& CI t 
J:ot v I'D o you want coke, cof 

fee or orange j u i c e ? j t fc^fa 
ibtb5 fl 3 6 31? freco^dr inkj t 

~fi?^$ hor s tartjj*yy K&*7J«tt!t*C. £ £ 

[0 0 8 01 iSfcRHs ^ v- 3 > 3 5 4 <r>^~-£ T y 
ff£iV IW-^2 0 4 «lt^»\ Xpa t 
h — h ^ >■ h r. /d r ink typesj 
otW^H6 0 Ctf>«|-efi, W3C^Xpa thai 
^iltV>S^ S»8fft0Stlf#T?**xtf^ W3C 

xMLKti (xqd £^&^zmzmfe$n%^ 
tiote^tiMom mws * ■em.* *r a: tf £ t a* m ^ r- 

to&Z-ktrmMir&zblz&MZtilt^a r bindj 
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3 6 6f/pyyht7^^f Trepromptj 
^ «V>T3 6 8^P^bt^K^h faskj 
j5S|6fr£*V 3 7 b T r e c o_d 

riBkj #WmZ*ib* KStbfcMHWM** VMBtt 
#1 0 £9^^ r c o f f e ej <D*S\ 3 7 2t 
Tdr ink] ^Hfflm^ffi^»J 9 ST fe 

tt, ikv^-ea— if ft, 3 7 4T% 7 p ny/ft^x^ 

/0 b r c r e a m_s u g a r j \Z. £ oT^ L? — A&ltl* 

Treco crea m s u g a r J # s P^I>*£rj £ tl 

coffee TNfeWll^ 3 7 8 lr*ft drink 

b T c on f i rmj £|gfirU ^^ft-fe^S^a 
>3 5 803 8 2 T^M^^v^ ^b r r e c o_y e 

snoj *mmfr^kxc&<>x^ 

nksj *SW4**L, ^^3 8 6T'fSW^^b 

^ tz itmm.m^mmm^ ioj:*?/j^i 390 

^cy^ht^x; h T r e t r yj jft^KffSti, 
t)5-S3 9 2t7'py/ht7 r ^x^ b T a s kj j&* 
Iff^HT, 3 9 4XmM^^^^ h Treco^d 
r i nkj *SPf VttJStva. 

[0 0 8 1 ] »*<&OTj&*fcfc>j&*5J: p tc. fb i n d j 

JC t^>ay3 5 4, 3 5 6, ^fc(l3 5 8^^: 

[0 0 8 2] «i<0jllB»liT?tt % ^ y s/ K^I^C^jtl-fc 

<bind TargetEtement = "OBJ" Targetl^ethod = 
T"Xarg>X</arg><arg>Y</argX/b i nd> 

fOBJ. F (X, Y) j t««"C*>4*\ fo^>^ 

*± roBjj ds, ^^^-j?*fcfMII» rxj *3iti« 

Xfa% 0 

[0083] TbindJ K*ft. bind ^^1^?^ iT CO 
-f^y^tfe5^Ift5 feventj JB*Sfe4r* 

50 <bind event = "onNoReco" = TargetElement = 
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"prompt"!" TargetMelhod = "$tart"/> 

i±, r D nNoRe c oj -T-<>- h&mbtltzt 

ftt^^^T^!), tir^f*?** h fpromptlj O 

r s t a r t j > y K^^a^ti^nir^^^. ^1 

i!i£r fRecoJ K*W»*ttTtfflt6«^i:8 
JH4tt r o n R e c o j "Cfe £ 0 

[0 0 8 4) Ktt«R£ tro rbindj Mfett. # 
&t>\c s rbindj Mit z^-ir^w^ 

£ Utatusj JRtt4rWi"S v r o n E r r o r j 

b^tf n t s« rb t n d j m^omco 
fc: t> Bitter ^ir*T?#* 0 

[0 0 8 5] BHilO^fiy^tS: 

*tt*5j:V Waluej Wtfl)W*tr, ffrtt K*»* 
V b by— KSr^-f fhos tj /P^f^^ 

3^3 5 4^ if tf* coffee £rp5£ Lfc £ # te 

^ y - a * fc *MM» * mm -r % a* ^ 5 a* & a fc & 

tfc^ol, -*—<?Ty7 Th o s t () /get 
__dr ink/dr ink- r c o f feej J 

077 d r ink7^ — ;U K£S r c o f feej 

I0 0 8 6] £fc s Tb i n dj mmit&ff-V— '<2 0 

#te, 5^dS|tffS*xSp t^->3>3 6 0tlt 

Tb indj ®f|ffj\ j< ^/ir— ^ f/C a 3 I con 

n e c t e d J #K#Rt feftfc £ & f w e 1 com 

ej Zfv^-y hSr*??*fcM:W£U r re co_dr 



(15) ^2 003-131772 

p/^^^tf. H0«te, {t^A^^fBi^P^i-^^^ 
7 B P/^f ^ hnnertextj ^rffiffii"^ r t fc X 

70 ^^fl fonReceived] £ £ ^"BTIIBT?* 

^'<^:*±^|lfT*©r:/y ->=j » &y*yiyy-\c 

[0 0 8 7] L-fc^ot fsmexj y*— 

fcftr^y -{r— >a ^£Tte*T?t5«fc 5ic-*-5 tot? 

o J , r g r a mm a r j % t.tzi'X r b i ndj /£if<7> 
y-fer — v^^fcfi fsmexj t/yx^ hiCO^^Tti, 
30 [ 0 0 8 8] rbindj B*tt, rforj »f 

<bind for = "prompt 1" event = "onConip I ete" 
targetElement = "prompt 2" = targetlfethod = 
"start" /> 

hTprompt I. j ir^A^^Y T 0 
nCompIeteJ IriMit Vfz t ^-/^^^ h 

prompt 2j Ostart/y? K^Pf t) ? ffi 

[ 0 0 8 9] ^FgH] 5 ^#^1^-5 £x Web ^ — /< 2 0 
Mi crosof I Corpora t i o n(Cj:^A 
^6(/iCWe bi^— /<2 0 2*CT^i2^«^co^ v-YT> 
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— ^*fcf*A<— ^*r*tf£ 4*«*C*5. 

rat, fww^^ y^>?^^3 2 o^s 
[0 0 9 0] ^7^7yHik-^r^^»^ 

- 4 'MM^^W y^e^a^a 2 Of 

ftffltSASP+^—^J^ 111 3^$|Xt^ 
[0 0 9 2] ftMA 

ybXh%* HTML, XHTML, c HTM 

L, SMI L, WML, ^WJ:?^ttt©SGML^ 

XML, tt£Z> £ 0 Hl£ft£H5 c *W*|B»-C«fflS*l5 
^ ^{3. P» hWHR e d m o n d OM i c r o s o 
ft Corpora tio n rt^pilTflK ^HT(r s & J: < 
^D^tlfc/y^KtJf)|), SAP I 5. 0^|S|«-efo 



(16) WI2 0 0 3- 1 3 1 7 7 2 

[.0 0 9 3] #KJW«^»Cfc±fclS»tt* KT*>«D 
<prompt, . . > S^Mtei^P^T' 

< r e c o. . . > wmmm&ttbxfKwmiDmft 

10 <dtm£. . . > DTMF*r«**il«»HlPi-*» 
[0094] 2 Reco 

Re c o|*j;s RniftA-fAAisJ;^**** 
X, 3£<CJB|SI*< g r a mm a r b i n d > 

[0 0 9 5] R e c o^B^ Star t*5&X$S t o 
20 SMI La^yPtlWUSMl L^ffi^Sft^^ 

#ift\ /p^7 Mz,m^^xmW){b£h,z>o 1?± 

[0096] Recofi Hfir"Ct 5f&®^"?- V Jr^U 

\C S #^C0^ — h\ 1" t£fc>% Uutomaticj, 
i(? f 3 i n g l e J % l^tclZ $m u 1 t i p I ej «r*ffl 

[0 0 9 7] 2. 1 Reco^V-r>- s y 
R e c o^tc^ 1 otfc(iaiR©Xi»*5J:tW£K3B 
tR-Cb i n dWSH^-fey h^^H, m^nIES*& 

[0 0 9 8] 7 5//^^7?f"efj:, Recoil 

40 {zxm^(Di-^x(DmMmmn^m)y^mx^^ zttc 

[0099] 2. 1 , a <gra mm a r 
grammar^^^, srcMMt^^H 

*fc»#Ra»^) ^^^-rvaciStts ^ 

TJ:V\ SS<7)g r a mm a r ®^lc:lt^-f"6 Z t&X 
50 »c?>g r a mm a r K**S**$Jh,5»fi\ 
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10 



20 
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[0100] Mtt : 

* s r c : ^yy^f^^fS^HSi^ 
it-SB 

■^y hflxm 1 : lan gJMMCflEJ. fcix.^ 1 a 
n g I D™ r e n-u s J *l*:H3£ffilS:^ D ^OJStt 
tt, langl D^tSUR I fcftjfe V ^J^*§-fr 

I 0 1 0 1 ] 1 a n g 1 D*«R«fBfT?»*S*b5» 

Hi d*S3tftttHrt-e»3e*n5) <0#:l£g r a mm a 

(grammar srq= ff FronCIty.ittr /> 
or 

(grammar) 

(rule topi eve Inactive") 
<p>froa (/p> 

(ruleref name="ci ties" /> 
(/rule) 

(rule naHte^cities") 

<1> 

<p> </p> 

(p> ^Th;v </p> 

(P> D>r*> </p> 
</!> 
(/rule) 
(/grammar) 

[0 102] srctl^WiJoi^y^^a* 
<sm I conf i dence="4(T> 

<travel text=^>7" Ka^£>^ h>J£?r£fcvO 
<origin_city conf idence="45"> v-T Yfr 

</or rgin_city> 

<dest_city corrNdence="35"> 

</dest_city> 

</t ravel) 
</sinl> 



32 



[0103] 2. 1. 2 <bxnd >^Sf 

[0 10 4] bindg»aoT»^»f 
(SMI) SrttfXMLK^a/vh-CAotJ:V\ ^ 

ML h^^^yh^MT(^t, 



JO 



[0105] ;£&T-<Df&imte, t^^^7 7/f^ 



5£> 



[0106) JR& : 

- targe tElement: <&2! 0 SML^f^^v 
a 1 u e ^>-^>^^fij»9 §Tbh5^I (W3C S 
MIL 2. 0<Z>»«»£R«K} 0 

-targetAttribute: flfcflfiBR. SML 
hl^cOM (SMI L 2. 0£T)a tiribute 
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£\ T^^KCDfffif* r v a 1 u e j 0 
•test :«^514Ro »«JS**««!?Sr6>n*ft# 
Sr^tXML Pattern (W 3 C XML DO 

e o 

• v a 1 ue : «9L 9—*fv FJI*lC« D 
«|^*K**^>'hd*&tf>«4:*S*i-S, X PATH 
(W3C XML DOMtgt^^i) 3t*?'Jo 
[0107] i; ±lgSML^H*?;6S**_k*t,S£, £t 

T^Or e c oHiR^ % or igi n c i t y^^t/d 

e s t_c i t y F*3tf>ffi&. 9—*fy h^—^KSS t x 
tBoxOr i g i n&JttF txtBoxDest 

<inpiit name*"ixtBoxDest" type="text" /> 
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-raci t^«r^-efo?), bind» 

AMMfefri: IT, d e s t_c 1 t yttA0ttflrjJM±i£ 



20 



<reco id="travel"> 

(grammar src^/city.xmr* /) 
<bind targetBlemeat="txtBoxOrigiB" 

value=V/origin„citr /) 
<bisd targe tBl emeu t^UtBoxDest" 

value=V/dest^city* /> 

</reco> 

[0 10 8] Z-COb indlt j^T^WIc^ £ J: 0 * 

<bind targetEl ement-"txtBoxDest" 
value="//dest_city" 

test=Vsmi/dest_city[@conf rdence $gt$ 40] 

/> 

[0109] bind WWlte, downlevel^fc 30 
T^lf^-^^— Si" 5 r e c o DOMt/-^^ h 

[0 110] 2. 2 iU^^O^P^xw" 
h£*V5o 40 

i o i i i ] 2 . 2 . i m-jf 

yT<?)Reco^Rtt(t ^7c^-y (dial 
o g turn) tC|»UT*]*lBI«««*«fi!li-e^^ 

♦initialTimeout: KttMtt 

h7^— o n S i 1 e n c e 4*< s s h*)«HH&£H 

5(2. 4. 2*#fiS) 0 ^AE'^tl^V^I^ s ©-^7 



•babbleTimeout: ffiitiitf^ HMttltd 5 
fl#P^#c automati c ^J^U 5 single ^r— K 
^CT*IB#K»SSix3o rmul tipleJ^K 

IH^K^B-frttiffi. status*— H 13 (2. 

■maxTimeout ; ttjMBJl, RRHtt*>&ttA 
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i P 1 ej KTO r e c o <OWr&^ babbleT 
i me o u t *ttJBf £»*±:n«lc, ^(Z>^#^, 
-tn-Ptt^)**^ J( fc fitted >- h ttl^W 

M £ ma xTimeou t Mtt^ i n i t i a 

lTimeout^it^babbl eTjmeoutC* 
-frit J: 9 ^#V^*fctt*L<*lt*i/(ift6>4v^r fcfc: 

•endSilence: feEliiRc a u t o rn a t i 

c Ktr<DR e c o <*>*fi\ *o»BIMttfta*JKS*L 

a u t o m a t i c ^l^-O 

• r e j e c t : famMtfto L*V o 

^server: <£«il#^ ^P^y y F7^-AOU 

erver=protocoI : // your spee 
c h p 1 a t f o r m^5. 7 7°^-> 3 Vf£# 
t ^.tfftoj: 9 [CUR I 3t*W^BR*S:*?iJ*:il 

protoco I : //your speechp I atf orm?barge i nEnergyThresho 
!d=0.5 

• ) a n g I D : ffi#*K. iTO*«»^ Vi? 

mi : I a n g ? 0 UngID = 

a mm a r ] a n g I Dd£JtK*ttttV^»*«ClB 

ot^JT^^ (2. l. l «r#ffi) o 
-mode : ffi»51^ 0 il^1"6f«^- K 

t o m a t } c J — K <b -5 C 

fO 1 1 2] 2, 2. 2 -fv/Z-Tj 

f£ti£) 0 

• r e c o R e s u I t : 9 l£ffl 0 2.1.2 

•fcLfc-t WM^—?Ty7>nm (sml) «r*tf 
XML DOM;-Wvx^ hrtfcftftSftaiBH 
fiOJSa, 7 6 D/<7^1nu 1 ] 

'text: t»9/*&^ e BffiSfrfcMI^X** 
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coResultT\ SML»(RjR!?-e<^*iSfi£S*C9 

jo ttt^c: £j&s-c#5„ 

•status: S8ffi:*)*fflo fBSS^^ S> h — A£ a 
(t^>ay2, 3. 1) *3j;UtAc t i vate 

yy^K (i?^>=3>-2, 3, 4) ^<7>wffi&w#fc:s; 
WicRftStL^M-l X 5 (2. 

[0 113] 2. 3 ^v^^hyy^K 
20 Rec oOStb^feiOfg r a mm a r <DffiW)ik&. R 
e c o ^DOMt7Vx^ hfJlfcSV^O^ V y K3H£ 

[0114)2.3. 1 Start 

Bffi S:J» i- ^ L r 1- t <o tit «WJ «: » Wr««^ Srifi 

I 0 1 1 5 ] «X : 
Object, Start () 

m *) m i 

ffrj^- : y y Kfl, ^if DdD s t a t u s K* 

(fire) t6 D Rr**l»»^tt. Xm&K^ (r e 
co status—— 1). 3C&(D ^ MMM t 

^ URI«U^ (reco status=~ 

2) . £tLl*&-p~77 y h7^»A^X7^- (reco 

s t a t u s «- 3) 
[0116)2. 3. 2 Stop 

s t o p y y •> Kfi % f^^p^^^^T^-^^fc^o 

Rec o^-y^^^ Hlf?(?)|f?:|* 
lb U B«K«««rt*^fc*^rfc»-t-^B«M8*Sr, 
»**iffJhUfcl»A*'CjR-r o Rec o^«fflt5f-? 
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&W±*t%?>~t\ a u t oma t i c^I— FT^ftlKl 

[0117] : 

Object. Stop () 

ftt 

ftL 

[0118]2, 3. 3 Cancel 
C a n c e 1 ;* y y YVX, 

iLU S:»*#»iJiftbXBl»SSlt*r«»feb. ifco£5 

ftw»a\ =>-^tt»a-t**«, 

[0119] : 
Object. Cancel () 

ftL 

ft L 

[0120] 2. 3. 4 Activate 
Activate*!/* Ktts XHRBftX* (CFG) 
cO»Wtt:«RlJ«r»iiffc'r*o EMMW*, Ts t a r t e 

SMrfc:* TMH & ftttHfcfft e>fti>\ W»ttt;:#ft«ift S 

n x v ^ ft v ^jgmxm k h 1~ s -f oscs?* i^&Mfw 

(0121] W: 
Object. Activate (strName) ; 
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* ■ s t r N a m e : iStMfcSnS*#J0>*tt 

sso« : 

ftU 
ftL 

[0122] 2. 3. 5 Deactivate 

[0123] «3: : 
/0 Object, Deactivate (strNam 
e) ; 

- s t r Name : ML WSlMfcS ft 

ftU 
ftL 

[0124] 2. 4 R e c o^-O )> 
20 Re co DOM*^*^ bf*»TO>f ^ 

[0125] 2, 4, 1 onReco: ~ (TM^V h 

ft* , KSfcffiBtas y 7 * If isttJB -e * 5 WWtMfJK *r»fc t 
£tcB8*&&*u£o a u t oma t i c^~FWr e c 

U Uy^^tS (2. 3. 2&#SfQ o onR 

50 [0126] mX : 
[0127} 
[^13 





<Reco onReco ="hatidfer' t > 




Objectoalteco = handler, 

Object.CKtRsco 

Ge^efniandler"); 



[0128] 4-<^ V*"?*J*9 V 
[0129] 



ass 



[0 13 0] ^yF/p^f^ :^>M^K7{i AH) o 
T^p^x-f Srit*Srt»fcftV^ K7{if-^l- [0 131] (5i|;^TOXHTML77^>Ml o 

h^r/^** l^^^^o^X^^T^^# T^^P^ttiU MWft7-<— yi- Ktc«t*i) ^T5 0 

< input name-"txtBoxOr i g i n"' type="text" /> 
< input name="txtBoxDest" type="text" /> 
<reco onReco="processCityRecognition 0 "/> 
grammar srG='Vgrammars/cit ies. xml" /> 



-20- 



(21) 



2003-131 772 



39 

</reco> 
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<scriptX![CDATA[ 

function processCityRecognltion 0 { 

smIResult - event. srcEfement recoResu It; 
origNode = 

smIResult. seiectSingleNode(V/or igin_city") ; 

if (orfgNode != null) txtBoxOr \% in. value 

or igNode. text; 

destNode = 

sm I Resu 1 1. se I ectS i ng I eNode C//dest_c i ty"} ; 

if (destNode !== null) txtBoxDest. va \ ue = 

destNode. text; 
} 



[0132] 

i 1 e n c e 



]]></script> 
4 , 2 onSilence:onS 
RecoOini t i a ITimeo 
*S*tfc*3R»IB (2. 2. 1 CO 



* a ti crecognx t ion^- 

[0133] m-X : 
[0134] 
ME 3] 





<reco caiSilenc^foatuIler" 




Object, onS Hence - handler 
ObjecLoitSUence CT 



[0 13 5] Y 
[0136] 



2K 













status = -11 fcSa^t 



[0 13 7] ^-<> h/D/^r^ : -f-O- h/M^ 

[0138] 2. 4. 3 on Timeout 
onTimeoutil, jfeSW*w*i*^^7 y h7i— 

■ Bf»*£S5T-r5WtC«Tfcft-ofcnia x t imeStt 
{2. 2. i^#fis) Jc»*$h,fcBWB«btriiS«'r5, 

:^^Kt jfel&frctt, 'titter— *T2?** 



me out (2. 2. 1 «r#JR) fd i oTfifttStifcl* 
( i i ) o 

[0139] zo-t'Ohits m*tmi:&toi&\£'&x> 
rn-f, 
mx : 

[0 14 0] 

r*5i 





<reco oitTimeout=="hattd}er' J .„> 


(ECMAScripl <7^-^) 


Object. otiTtmeout- handler 
Object. onHmeout ~ 
GetRdHiandler"); 



[0141] ^- 
[0 14 2] 



[^6] 
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3 Mfci maxteti* J; oTfR® £ frit 




reco status &-12\zW£7£ 



to 1 4 3] ^^h/PA'f^ : ^-<> r-z^F^te 
[0144] 2. 4. 4 onNoReco : onNo 



XfiK status^- FiaotPI^S, -Y^V 
!^ ttlWft ^ n ^ ^ Sr 6 tc fflb-T & o 
[0145] P^C : 
[0 14 6] 









Obj ecL onNoIteco = handle*; 
ObjectOJtNoReco "» 



[0 14 7] 4<<ls bt^v 5 : 
[0 14 8] 



[3t8] 



status ^a^* #R)£U null gffl^M£r^i o status 

status -13 : #^I«W$^^^^^*V\ 
status -14 : % 

Z Z 1 tf> reject Mfc£#KI>° 



[0149] -<V h^n/*^-* : *<V h^y Kyft 

[0150] 3 prompt 

p r omp tMfi, ^^-r-^tti^^li^l-^OiC^ffl 
o T J: i \ 

[0151] prompt KfHtl. > U-<;U^^ £ 

D* i c j: o T , * ft & T y -f u / v y* y $ ¥ <r> ir ~f *J $ 

[0 15 2] 3. 1 prompt(^3^^ 



1*^*;* hSr«*-f ^*TttTJ: 



i P t fit, ^T<D 
^\ 

<prompt id~"Welcome"> 

ACME tCteKK^fc ftgfrV^t 5 
</prompt> 

T y & t> tc$#> 6 r t # 6. 

[0 15 3} 3. 1. 1 »^'-^7^ 

romp tB#rt»T*fflt5Ii:^^*5c (^7 
t-ry htt, 3. 2. HdfgilSLfc U t sj UttT' 

<DX&>%>o 

<prompt id^'gi veBaiance") 
PffiiC <einph> 5 </emph> 
</prompt> 
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[0154] 3. 1. 2 is 

p r omp tO^MO=f>T^yi^ p r omp t jd*tH 

[0155] Value S? 

value:tt«c K*:* * V h rtfcifefcWIfflHi 
Sr3R 9 mi"* 

t 0 1 5 6 ] JRtt : /<? 
-targetElement: ffi^^^ 0 hrefl 
fcte targetElement <^Vvf ;ft,;6*&*BJ£i~£ 

-targetAttribute: ffigSlRo Itt^flSt 

-href: 4*31Ro > h^UR K hr 

ef^targetElement COW^SffiSf ^» 
1q\ href ^targetElement ^H2££;ft. 
3. 

[0X57] targetElement JRftte, 20 
d7)S targetElement {£ J: o "Cffi^ £ ttT ^ 

(proipt) 

(value href=Vwav/beep.wav* /> 
(/prompt) 

[0 15 9] 3. 1, 4 MS^ny/F 

<prompt id="Weloome" src="/AOkEWeatherPromptsPelc 
ome"/> 

[0 16 0] 3, 2 *tt:feJ:U^n/^4 
p r omp tSStte. aT^Mtl WfP^U^fr-fyl} 40 
If) teXTf-fus^T 4 (/^^^W^-yT'^ 

[0161] 3. 2. 1 Mtt 
- t t s : ffitSStRo Sa*&**^*«K:BBi- 
57-^7 7^1^^^ b<nmt tsap 

I 5J o 

ft«W, •flRSF^^»>'^ hcoUR I (3. 1. 
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targetAttribute UttSMEJa L, 
T, targetElement tC^S*flWfe^»3tf 

p r omp t TtxtBoxOr i 

g i nj mmte&Xf TtxtBoxDestj mM<0 
fv a 1 uej JRtt^T 1 ^^ H££PA£*V£ 0 
(prompt Id^CoafiriiO 
ff<Jfe» 

(va 1 ue targe ml enen t=" tx IBoxOr igiu" 
targetAttribute= M valiie" /> 

(value t arge tBl emen t=" tr iBoxDes t " 
targetAttribute^Value" /> 

(/prompt) 

[0158] 3. 1 . 3 ^774/1- 
v a i u eSDlii. -g*J$ £ Jxfc p romp ttf>ft:b9 

UATO'PJTfi, p r omp t 9 *C-tf— 3/iV^5f 



b a r g e i n : flEKiRKo S&. ^V/FOW 



r g e ) n IttfcfflTtftVN Ba r ge i n=0 
f*, SPB#b a r g e i nW*IffiX*hZ> c wjfr!*. ^7 y 

^S^^iX5 0 r e c o **B)te3;|xfcl*/ft"t\ *-!7- 
K^fcft^^U^— ' <— ^<Db a r g e i n$£K<D¥h 

• p r e 1 e t c h ; ffiRilR, -<-v^n- K&Hfc 
££SC, prompt !:7 r 7 9lFWT*Wf%lZ&m$3j:Xf 

^ 0 x^A" hO«fi f a I s et^5 6 
[0 16 2] 3. 2. 2 T'p/^t^ 
T ^if^, promptODOMt/t/ 

^ |^]t»T07'DA 4 r>r M™a c 

■bookmark: V Lfc**<^*R!E 

ho 

^status; WzWl*}W-Bo "grpy? y h"7^- — A(C 

* Innertext : MS>J 9 #fflo 
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prompt (Dv-^c^ b&Sftffl^ (transcr 
i p t i on) SrlMfeU r*b*s*j*!S|l(cjSe>*L* 0 
fc^^S^ p r omp t^FTBtjft^r^/WW**^ 
;^D^r^ll^p r omp t(Of 

#V\) 4rffi#?U »V^fcfc£tf, prompt(D7^r 

T\ »»Stt6d\ * 5 ^ttntfttffl inn 
e r t e x t /n^r^fl »W = V^i^yir^tf p r 

[0163} 3, 3 promptly s>K 
p r omp tS4ft, p r omp t^DOM^v 1 *^ 

5o -OZfrSrmv T^T/U^/l^^lf^ prom 
p t^-^^rc^ h<DMlt& (start) &£TI&± (s 
top), Mfr^(Op r omp t CO— Wf*Jtfc (p a u s 
e) *5j;tffflJB (resume) . *fc WC-frjftStbfc 
*^03S«*5J:tJf«*©«S£ (change) &Hfrf 

10164] 3, 3-1 Start 

p r omp t©W£*rlMfrt-6c 3IIMS**fc:h,fc^l» 

CD P romp t±-?&=? h<Ofy&ffi&<Z>mmz. 
m^^ntz] fc^ftSftSfcifr, Star t^itlKU 

[0165] *»rftr : 
Object, Start ([strText]) ; 
/^^— : 

•strText: ^AttC^aiA SiT,*^** K # 

*t 

M# : status tdRjCI"* *^y77 
^^icioTfCtJIKSK-CV^Wtto n C o 
mplete-f-^V hft s M*&£ix£o 
[0166) 3. 3. 2 Pause 
*J*/<3/ 7 7 ^ 7 7 ^ v-^^^lCff^^r ik"f 

y y K(i»S»-C*>5o 

[0167] : 

Object. Pause () ; 
RSffi : 

<html> 

<trtfe> 7'a>yhfW </title> 
<head> 
<scr jpt> 
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mtv : 

[0 168] 3. 3. 3 Resume 

i^s— «Fffjh*ttTv^ftv^»^ rosy y y Ktt*Bft-o 

[0 16 9] mX : 
Object. Resume () ; 

[0170] 3. 3. 4 Stop 

$\ rcsy y y FiitF^7 7 r &y 7y^>^Wff 

[017ll«t: 
Object, Stop 0 ; 

: 

Mfl* : 
*L 

[017 2] 3- 3. 5 Change 
S40lt^j;t>y^fcij:fi«It^, Chang 

e « -M^ * ec ifftfffl 1" r t & -e 1 6 o 

[0173] : 
Object. Change (speedy volum 
e) ; 

. 

30 -speed: ft^i" 5HI$C> s p e e d - 2 . 

0f4iilflE^iiK*:2»{Ci-Ss speed = 0. 5 ti*S 

•volume: (S3Ei~£BMfc, v o l u m e ~ 

2. OttWtOflFiSraflfflC^Ss volume-0, 
5 ^J,^^^^-^>lC-r6s v o l u m e - 0 \tT*7 

[0174] 3 . 3.6 prompt Oft!J#P0J 
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<! — 

function checkKVIBargeinO [ 
news, changed, a 0.5); // TO**? » — AfcTtf 5. 
if (keyword, text = "") { // JS*tiW«[*»!^*>5o 

news, changed. 0. 2.0); // ^ V AftTclcHt" o 

keyword Start () ; // 
} else ( 

news. Stop 0; // KJ&Mftffl ^n^^h^ffih 

1 

) 

// 

</scr i pt> 

<script for-"window" event="onioad"> 
<i— 

news. Start 0 I keyword. Start 0 ; 

// 

</script> 
</head> 
<body> 

<prompl id="news" bargem="0"> 

- <!— 
</prompt> 

<reco id="keyword" 
reject="7CT 

onReco="checkKWBa r ge i n 0 " > 

<grammar src- http://dena I i/news barge in grammar, xm I /> 
</reco> 
</body> 
</html> 

[0 17 6] 3. 4 Prompt^y^ y $ $ icSt'SBi" h t £ Z.W<> 

^7t°— h LT& *) , -^tfV^ K^te, prompt [0178] fllSl : 

OJMfci LTffS-rS 6. [0 17 9] 

[0177] 3. 4. 1 onBookmark ?£40 IS 9 ] 





<prompt irrriBiXjkmiirk'^'liandla-". 




Obj ect. cmBooi^nsrk - handler 
Object. onBookuiarlc ~ 



[0 18 0] ^-<^ ^y^x £ htttR : [f 10] 

[0181] 
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[0183 

<Ob a r g 
(b a 



49 

13.4,2 onBargein; if 



50 



V\ ) ^.^-^^ K^SrfS^LTts barge 

[0 18 4] 
»H] 





<prontpt onBai^ r handIei J \ . .> 




CkjectoflBatgem "handler 
ObjectoMtBargpin - 



[0 18 6] 



^ m 1 2 j 



[0187 
[0188 



] 3, 4, 3 onComplete:pr 



[0189] ffiX : 
[0 19 0] 

mi 3] 





^prompt oitComplcte^'handIer ,T ,..> 




Object onConpIcte - handler 
Object anGottifrtete - 



[0191 
[0192 



] 4-<^ ht^^x^ hffffR : 



^ [»1 4 J 









prompt l$&&$ETi-% 




n±^7&^m 9 (£35 7 UfctS^KJ: status «0(£$:&i-<£> 
-£ ? statu* 2r JrSe^f&^&frfc J; ? lc 



[0193] h^v^r- -f : -T^V 

[0 19 4] 3. 4. 4 y^^^^^iV-f^h 
JMT^0f? ! Jil> p r om p tfU^t-b a r g e i n J&*3E 



#Si-6fc»f^s ■/y^^-^-r-<>h«r^<7>J: Steffi 

mi~$>fr%:7F-t$$X*foZ> 0 o n B a r g e i n^>- K7 
J)*, ^'p— /<,U f m a r k J ^ffc&r p r orn p t f*ITiI 

i£r e c o0>««L9iBlft ( Th e a r dj ) T?«JBSft 
6„ 



<script><! [CDATA [ 
var mark; 

function interrupt { ) { 

mark « event. srcE I ement* bookmark; 

f 

function ProcessCityConf irmO f 

confirm, stop 0 ; // a y"7 r 7 y i^i~6 a 

if (mark = "mark_or igjnjsity") 
txtBoxOr igin. value = 
event. srcElement. text; 

else 
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txtBoxDest, vafue = 
event, sr cE I emeirt text ; 
} 

]]></scnpt> 
<body> 

<tnput name^'txtBoxOrigin" value-'v'T VA?" 
type="text"/> 

< input name="txtBoxDest" type="text" /> 

<prompt i d~"conf t rm" onBarge i n=" i nter nipt 0 " 
barge i n=' r 0"> 

From <booktnark mar k="mark_or lgin_ci ty" /> 

<value targetEiement="orgin" 
targetAttr ibute-" value" />, 

<bookmark mark="marMesOity" /> jfttfir < n £ Sr^I L 

</prompt> 

<reco onReco="ProcessCityConf i rmO" > 

<grammar src="/grm/t033/cities. xmi' r /> 
</reco> 

</body> 

[0 19 5] 4 DTMF -bind: D TM F ^f&m^^ft V 4 —fr Kfcf I 

hfl^v^^^-^r^^WWSttrttffl+S^^ [0 19 7] It: 

y 7"f^ >-*^>;Mfc-f6C i:^-eg:5 0 OX -targetElement: ^JL M&ft&IBIUS 

MF(*^lMfc$ftfc£:#te;, protnptt^n *3 S T t>H5S* (fc t * 3 C SMIL 

iCbarge-in^^y^TOSt^;^^ 2. 0<D»fi>£IWfc) . 

6o DTMFi^»C^3UT|^TTt^i:-5^^^J:l/-f-< -targetAttribute: KWiKX:^ftl 9 S 
vr^^\ *^^^S5St^i:§Wj»H -|ft SO Xbh&P-tfy h9m<r>M& (fciAtfSMIL 

t^/7^f 2 1 6^fV7t-/<2 1 4 ^co^l 2. 0O^£p3ft) * 77t^F^||i r v a i u 

[0 19 6} 4. 1 =r>^>^ • test :MSrw*#, f7*/H^iD«tttru 

• d tmf gr a mm a r :^V7^ >^MT^§ e e c 
f^l : Srx** M:v^tV^/t^ 
< input type="text" name="ci ty"/> 
<DTMF id^city^ohoice" timeout== w 200(T numD i g i ts=T> 
<dtmfgrammar> 

<key value="1"> v-T </key> 
<key vaiue="2"> ^^h> </key> 
</dtmfgr aroma r> 

<bind targetElement="ci ty" targetAttr ibute-"va lue" 

/> 

</DTMF> 

r c j t y_c h o i c e J ffi1£Mk$tltiib § ^ 
— tPfl* 1 SrJft* t rseatt) ej ^ 2 t 

tiUft<om&&ffihP\ v^x btt^h\ 

10 1 9 8 J 
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(Input type-"text" Bame~"area_code" /) [0 19 9] 

{input type-" text" aaiae="pii0i*ejauiBber" f) 
<DTMF id="areacode" numBigi 15=^3^ 
onReco="ex t ens i on, Ac t i vat e <) '*> 

{bind tafgerelemeIlK'area M c0de ,, /) 

{BTHF id=*extension" ntfmDigii3= w r> 



{bind targe tElemeftt^pfconejtiuiuber" /) jq 
</DTMF> 

<input type="text" name="credit_card_number" /> 
<prompt onBookmark="dtrof . Start () ; speech. Start C) * 
barge tir="(T> 
<bookmark name="start Ing" /> 

fc«i»f-6j&\ fixity K»#*A;fcUT< 
</prorept> 

<DTHF jd="dtmf" escaped" length="16" 
i nter d i g i tT i meout="2G00" 

onkeypress-^speech. Stop () "> 

<bind targetE!ement="credit_card_number" /> 

</DW> 

<reco id="speech" > 

<grammar src='7grm/1033/digits. xm!" /> 
<bind targetEfement="credit_card_nijmber" /> 

</reco> 



[0 2 0 0] 4, 2 m±&£Xfy*vs<7- 4 
4. 2. 1 JRt£ 

• d t m f g r a mm a r : &fl( tt DTMFM^UR 
[0 2 0 1 ] 4. 2. 2 

- DTMF g r a mm a r : fgl&t? /Wih^-h 
DTMF^ffXML DOM Node^^x^h 

{dimfgramioar) 

{key value="CT>0<Aey> 
{key value= ,> l")l<Aey> 



"9">9<Aey> 
"*">*<Aey> 
'TWAey) 



{key value= 
{key value= 
{key value= 
</dtmf grammar > 
♦flush 



5 Ix^^/SiA^COy-^^^^o 7^7 ^-/V 



e o 

• escape 

/"C^CO escape * escape ^ 1 

• n umD i g i t s 

fc#>tf)3r — ^ escapeirlengt 
h ^ ft S H fc*-^ , if Aft dSfd f c £;H 

40 6iDTMFtr>^l*T1"5 0 

• dtmfResul t 

^f^tfcDTMF^)MU^^i®«fflt 
•text 

S -C#Ht £ ixfc b - £ «r»IA b T ^ 6 B£ i& 5? 

• ini t ia ITimeout 



1 cDDTMF ^ — ^ 
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* interdigi tTimeout 

[0 2 0 2] 4 . 3 ^ ^ h ^ y y K : 
4.3.1 Start 

[0 2 0 3) mX : 
Object, Start () ; 

«^ : 

[0 2 0 4] 4. 3. 2 Stop 

t o 2 o 5 } mx i 

Object. Stop {); * 
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[0206} 4, 3. 3 Flush 
DTMF/^77tr77y^tS, F] ushfiD 
tmf -tr y > ^ v^ic^tB-rr £ *s-e# ?fcv\ 

[0 2 0 7J «X : 
Object, Flush () ; 
/<? 3§l *) « : 

#M : 

[0 2 0 8] 4. 4 h 

4. 4. 1 onkeypress 

1f ifi escape ^ — ^rOP < £ , onKeypress 

20 [0 2 0 9] : 
[0210] 

is] 









Object <mki?yprcss * handler 
Obj ccl.ojikeypf QSv<4 - 



[0 2 1 1 } -f-<> ht^x^ hit* : 
[0 2 12] 



[0 2 1 3] h^P/<f^ 
I 0 2 1 4] 4. 4. 2 onReco 



[0215] tUS: : 
[0216] . 
[117] 









Object,(mReco -handler 
ObjecLonReco ^ 



[021 7 } -r-O- 
[0218} 



iJr[I18] 









^— escape — ^-tf-f &Jt&^—^ h a—<? 







[021 9] -f-O b^pX^ : -Y Y/^ls K^te 

gn-rs^-o h^v?*^ htss^-rs - tds-r-*^ 



[0 2 2 0] 4 , 4. 3 onTimeout 

50 JBttsnSo w^-f-^^ hfiBBt^o-fe^fcgfttttzff 
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58 






* [0 2 2 2] 


[0 2 2 1]!^: 




* [$19] 






<DTMF onlfmcout^'liaiKiter'' 






Cl>jcc£.oiiI1mooul — handler 




(ECMAScript 


ObjectoiiThceoul — 









10 2 2 3] ^^b^v*^ 
|0 2 2 4] 



[f20] 



✓*yvu- 






— ^ &f&m$frftv> 







[0 2 2 5] J^lsYy&^J ^^yh^VK?^ 
[0226] 5 Callcontrol ±~? 9 Y 

ft * fc ft sag ft v \ 

[ 0 2 2 7] .Z.<D7t?i?=.& h^LTfl 

<7> 3 *# tf> J*M » 69 -f 6 Wtt /'c t 1 4 ^ -To 

[ 0 2 2 8] 5. 1 /o^7^ 
'address 

WR9»ja. XML DOMZ-Kt^x^K mm 

N ANItALi^tTfcoTM V 

o I P<?>»d\ rix#Si¥»<D I P7 Kt-^T^, * 
r ingsBeforeAnswer 4f ffW t£ lt^~t 7 o 

m <D^m L.*oEHfc. f7^^ h»**Bh?*>t> „ ctt 

[0229] 5. 2 > y 2/ K 40 

[0230) 5. 2. 1 Transfer 

i/xf Alii- SI ^tM)TOi:^7 >i - v-xf a !J 
y *r»T1" 6 r; t s&st** 6 0 

HWfc : 

telephone. Transfer CstrTex 
t) ; 



• s t r T e x t : @T»!<a§fflf*<£>T K^ B 
ftL 

fix W*UB+<fciM\ - 5 lt#f«7 7 y £ 

* * fc f i@^#®tg ft Jfc^ L ft V \ 
[0231] 5. 2. 2 Bridge 

& s t r U I D£'MbTM£tL& t -fey^g 

SWi:ft5«8S^7^ h'7d— Afi, IoT<^lf^5E 
AtWftS^^lPteifftl-^cirds-c***, *Ht. ft 

[0 2 3 2] »3: : 

telephone. Bridge (strText, 
s t rUI D, [imaxTime] ) ; 

• s t r T e x t : <&§C C 3f»3^S«#^T K^ 0 

* s t r U I D : <&M<, ^^**r— *KJ*BlJ1- <5 ir y 
v-a^I Do *5**iXfc#d*K**l,Si#[C, srtU 
I D^a d d r e s s JRf*fctC**Sjh.S 0 
■ i m a x T i m e : ftScil^ i|E*S*Vft:Pf O^ffi 

(&**i*Ri#IH« Jt*S*bftv**fi\ ^^^/wh^ffiti 
ftL 

[0233] 5. 2. 3 Answer 
fllX : 

telephone. Answer () 
R 9 AS : 
ftL 

onAnswer-f^^ httlBttSixftt \ 
[0234] 5. 2. 4 Hangup 
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«• 

^S: : 

telephone. .Hangup () 

mvm. : 

*L 
«^ : 

[0 2 3 5] 5. 2. 

telephone. Connect (strTex 
t , [iTimeou t) ) ; 

■ s t r T e x t : &m» M©f:ff^7 KU* e 
* i T i m e o u t : fttltt Wfr£*ihi-5 J*T<0 

[0 2 3 6] 5. 2. 6 Record 

«S: : 

telephone. Record (url, endS 
ilence, [maxTimeout], [init 
ialTimeout]} ; 

<HTfC> 

<HEA0> 

<TfTLE>Logon Page</T t TLE> 
</HEAD> 
<SCR!PT> 
var focus: 

function RunSpeechO { 

if (logon, user, va lue = 4 



60 



- u r I : &m* E»*ftfc^*^u r l 
■endSilence: «Wi&Mfcffl Sftfcftfc: 

•maxTimeout : tttSR. IBft-f 5 
'initialTimeout: ^^51tR 0 Sa®<©*& 

[0 2 3 7] 5. 3 -Y^V K7 
[0238] 5, 3. 1 onlncoming () 

[0239] 5. 3. 2 onAnswer 0 
[0240] 5. 3. 3 onHangup {} 
Mi, ^Oi/^A^H a n g u p*fcf*T r a n s f e 

[ 0 2 4 1 ] 5 - 4 0!) 



focus-"user"; 

p__uid. Start 0 ; g_ log in. Start 0 : dtmf. Start () ; 

return: 



if (logon, pass, value = ( 
focus~"pin"; 
P_pin. Start 0 ; gj og in. Start 0 ; dtmf. Start () ; 

return: 



p_thank. Start () ; logon, submit 0 : 
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function login_reco{) { 

res = event. srcEIement. recoResult; 
pNode = res, se I ectS i ng I eNode C//u i <H : 
if (pNode != null) 

logon, user, value = pNode, xm I ; 
pNode - r es. se f ectS i rig I eNode {"//password") ; 
if (pNode I- null) 

logon, pass, value = pNode, xml ; 

} 

function dtmOecoO { 

res = event* srcElement. dtmf Result; 
if (focus ss "user") 

logon, user. value = res; 

else 

logon, pin. value » res; 

} 

</SCRIPT> 

<SGR1PT for="cal IControl" event~''on I ncom i ng"> 
<!— 

cal IControl. Answer () ; 

// 

</SCRIPT> 

<SCRIPT for^oallContror event~"onOffhook"> 
<!— 

pjna in. Start 0 ; gj og in. Start 0 ; dtmf. Start 0; 
focus-"user"; 

// 

</SCRIPT> 

<SCR I PT for-"wirxlow" event="onload"> 
<!— 

if (logon, user, value !=*"*) [ 
P_retry. Start 0 ; 
logon, user. value = 
logon, pass, value = \ 
checkFi eldsO ; 

} 

// 

</SCRIPT> 
<B0DY> 
<reco id-"g_Jogm" 

onReco="login_reco() : runSpeechO" 
timeout ="5000" 

onTimeout="pjniss. Start 0 ; RunSpeechQ > 
<grammar 

s r c=http : /Aokanee 1 /etr adedemu/ speechon 1 y / 1 og i n. xm I /> 
</ reco > 
<dtmf id^dtmf" 
escape- 

onkeyt>ress="g_login. Stop 0 
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onReco~"dtmf_reco 0 :RunSpeech 0 " 
i nter d i g i tT i meoiit~"5Q00" 

onTi!ueout="dtmf. FfushO ; p„mf ss. StartO ; RunSpeech 0 " /> 

<prompt ld="pjna\n"> zi—lf I D*5 £X$ mm < 1*$ \,\ </pro 
mpt> 

<prompt rd~"p_uid"> if I DKttflfflB UT < l\> </prompt> 
<prompt id="p_pin"> tTV^tfWttl LT < K S V\ </prompt> 
<profcpt id="p„miss"> * LiRfc 9 **A/. L Lfc, </prompt> 
<prompt id="p_thank"> fe^s»^^U: 0 *55 
fc£V\ </prompt> 

<prompt id~"p,retry"> * vmh Y> £itAv* ^-f I D^i:^^yf 
L£i*rA/ 0 </prompt> 

<H2>L6gm</H2> 
<form id~"logon"> 

HID: <input narae~"user" type="text" 
onChange=''run$peech()" /> 
Plff: < input name="pass" type="password" 
onChange~"Run$peech () " /> 

</form> 
</B0DY> 
</HlWL> 

[0 2 4 2] 6 ^7P^7D-OjBS ^7P^7o^t*llit5*i6t:Stt©T?fc6, ^ 

6 . I 7 p ^7 p « SrUSt 5 fc^OH TM L *5 /^/oV/h^syf >^illrt$fc*iCHTM 

■fct*** y/hu?«fflr^«it A^3#«^^(aM LAM* (hooltipj »«£ lti/f7'7 

ffi£r-&U A*ffl^3:««#^^^*raWt1"S, **** ft«ffi4h5) T\ titl e*tt*r«JS1-5. 

<html> 

<title>Context Sensitive Help</titje> 
<head> 
<script> var focus; 

function RunSpeechO { 

if {trade, stock, value == "**) { 
focus-"trade> stock"; 
p^stock. Start 0 ; 
return; 

I 

if (trade, op, va I Lie = ( 
focus-"trade. op"; 
p_op. Start () i 
return; 

] 

trade, submit () : 

. . J 

function handleO ( 

res = event. srcElement. recoResu 1 1 ; 
if (res. text = "help") ( 
text = "tSlUT < 
text += document. a H [focus]. title; 
p„he!p. Start (text) ; 
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} else { 

} 

} 

</script> 
</head> 
<body> 

<prompt id="pjieip" onComplete="checkFi ledsO" /> 

<prompt id~"p_stock" onComplate='g„stoGk. Start ()"> t^^OTlt 

<fcT£v\, </proropt> 

<prompt id="p_op" onComp I ete="g„op. Start 0 "> JK9t*1"*^ W^Vf"** 

o </prompt> 

<prompt id="p_quantity" onComplete^g^quantity. Start 0 W > WWk^-tfr 
0 </pr ompt> 

<prompt id=' r p_price" onGomplete~"g_pr ice. Start {)"> WfetevK £>T?i~ 
;o* 0 </prompt> 

< r eco i g^stock" onReco="hand ! e 0 : checkF i e I ds 0 " > 

<grami??ar src=". /g_stock. ml" /> 
</ reco > 

<reco id="g_op" onReco="hand I e C) ; checkF i el ds 0 * /> 

<gra*nmar src=". /g_op. xml" /> 
</ reco > 

<reco id="&_quantity" onReco^handieO : checkFieidsO" /> 

<grammar src=". /g^quant. xml" /> 
</ reco > 

<reco id^'g^price" onReco="handle() ; checkFieldsO " /> 

<grammar src=". /g„quarrt xmT /> 
</reco> 

<form fd="trade"> 

< input name="stock" title^'Vtock name" /> 
<select name="op" tit!e="buy or set i"> 

<option value="buy" /> 

<optlon value="$eH" /> 

</se I ect> 

< input name="quantity" titles" number of shares" /> 

< input name-"price" t i tle~"pr ice" /> 
</form> 
</body> 
</html> 
6.2 SMI LC0f£iE 

EAT*>«|f*, SMI Lftl^MLfcp rompt^rec oSS&Ofgft 

<html xmlnsit^'urn^chemas-microsoft-coin^ime" 

xml ns : sp^'urn : schemas-m i crosoft-cotn : speech"> 

<head> 
<styfe> 

.time (behavior: ur i (#default#time2) ; } 
</sty!e> 
</head> 
<body> 
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< input nafne="txtBoxOr igin" type="text"/> 
< input name~"txtBoxDest" type="text" /> 
<sp:prompl ciass="time" t:begin="G"> 

</sp:prompt> 

<t:par t:begin~"time. end" t:repeatCount=" indefinitely' 
<sp:reco cfass-"time" > 
<grammar src=". /city, xml" /> 
<b i nd tar getE I eraent=*txtBoxOr i g i n" 
va I ue='7/or I g J n„c t ty" /> 
<b i nd targetEl ement="txtBoxDest" 
test^Vsmi/dest_city[6conf idence $gt$ 40]" 
va!ue=V/dest_city" /> 
</sp: reco> 
</t:par> 
</body> 
</html> 
SMEX (Message) 



#12 0 0 3' 
5£ 



13 17 7 2 



[0 2 4 3] 7 

y ^ & h 

Simple Messaging EXchange 20 
/EXtens i on^Wtfc^SMEXfi ?7 
JTls hf'W *<D-?7 y h? * — J*±^fo&mWM 

i?&<5 0 ^«<smex>S:IifcSiat XM 

[0 2 4 4 ) ^>*£^;Mfc1"S£, isx.fi Mix 

✓ ■^■-f lvalue). I^HlC s 

gltStofc^ yir— v^SrftJti-^s XML DOM N 
o d e ^<-f ^(n^u/^ 4 fe^ff^. messaged 
^^ML 7*7 y h 7*-2*/ v^Sttft* 



[0245] message ^fcfi s m e x Tt^iSA £ 
h*7yn-KSii5t8SISW. 0<<a»£\ sm 

e y\%&m$ v-^t yy'-tmi u 

[0 2 4 6] smexty^n^ ^ir-v^^ 

14, (S I P*fcttCCXML"C*ffl4*irv^-5J: 9 
ft) yt^7^— 7y h^l<«*$ 

sfcT-^^^JL 7"? y h7*~-J*tT7"V tr—*s 

— & T y 7><DMm<t $ *tfctt« Pffttt*: T* * 

[0 2 4 7] 



<smex id="logServer"> 

<param name="d: server" xm I ns:d=''urn Microsoft. com/C0M"> 
<d : p r o t ooo 1 >DG0!K/d : protocol 

<d cisi d>2093093029302029320942O98432O98</d : c 1 s i d> 
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70 



w m 



<d: i id>0903$59304903498530985309094803<:/d: iid> 
</param> 
</smex> 
< [ i sten . . , > 

...// A*7-{^K©r e e o*S*Sr*6|*1"^ffliO»^ 
<bind targetElement="logServer" targetAttribute~"sent 
value»"*[@log $ge$ 3]V> 

</l isten> 

[02 48] r<©W4s ^y-^i d*3«fclM>*-- 7* 

riogj feBKti-5SML^-KtcWltjW*.« 0 ±15 
T^!?4r— i/»>'W3B*f4^ Ob i n d 
«»* (directive) SMfefflt" £ r t tC J: o 

77 tr y^^^9!>ifoiiBf#-e«fflT?#a 0 

[0 2 4 9] ClOWtis smext^xnr«F* 

< input type="text" id=" remote"/^ 
< input type="text" id~"transfer'7> 
< input type="text" id="iocar7> 
< input type="hldden" id="sessiorUd'7> 



[0 2 5 01 



<smex i d="le 1 ephone" sent="start_ i 1 sten i ng"> 

<param name="server">http : //te I ~sv r /what eve r </par am> 
<bind targetElement="session_id" value="//sid'7> 
<bmd targetElement="remote" vafue=V/rernote„addr'7> 
<b t nd tar getE I ement="t r arssf er " 

va I ua= V/transf er„addr'7> 

<bmd targetEiement=" local" value='7/local_addr'7> 



</$mex> 

[0 2 5 1 ] roWi, ^itStofc^ y-fe— v^^r^iii- 

addr. transfer addr.&^tFloc 4# 

a 1 __a d d r ^^T1"^ £ itlt 

I0 2 5 2] :oiT(t ffiSif— /<bmfB-f-%>fcib 

T/D > ^ y^iM $ tifc £ # J: o r#i 

rmi rcD^J"Cft, if— r s t a r t_ 
I i s t e n i n gj ^ -yte — i/ & mini' Z> Z.tKXo 



WCttfli^tSj&s. Web^!:IiU, 77!)^- 
d&m.V7<<~- /l/M^$^ e r e c otf)i§^£p] 

[0 2 5 3] 7. 1 7o^f^ 

-sent: ^^/liAW^t, ~7>y y F7^- 

t^is« $ n 5 ^ y ir - 5 3C-f»?iJ<, 

s e n t *i 1 v a i u e £ LTSiffl §^^®#fl^-f , 
^•(O^yry^^fV^-yf (dispatch) & 
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10 



/n/<f^ rvalue^ LT^ffl£:h,£3# 
-fi\ ^fctt^^yc^T^^n u i ! h 

•received: 0 S£ffl fc ^ ^rit^ofc 
^yir-^^fXML DOM Noder^. ^ 
S^H n R e c e i v e^^^ill 

«j&S-C#$*T?\ rva luefcfCftfflnnBiftS. 

-timer: KR0/*»*T?fcoT, ^^AT!? h 

oSRo ^n'<TV tdjEOM^J^STipn^ir. ^ p ^ 
AT!) h-f h h y ^iTf^ p s/^&ffJbS* 
-status: a^fl^oT, 

<y-£ — receive d/P/'f^ 

Mi<at«SHfc»«, s t a t u s KfetOT?* 

[0 2 5 4] 7. 2 -<-<>' h 

<smex id="logServer" , , . > , . . </smex> 
<script> function logMesseagedogClass, message) 
logServer, sent - logClass + *|* + message; 
} </script> 

[ 0 2 5 7 ] Hl^icJl IM*****^^*"! 

rsfc^fca* ±»o«-et4, P^i/Hlftil i d 

<smex id="!og$erver" onload="addFunction () "> 
<scr ipt> 

function myJogfoessageOogClass, message) { 

logServer. sent = logGlass + + message 
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**onReceive: r©^^y Ml ^ h 7 ?*• 
— At* y *-^dSSJ3iU*:i*Ki3S«SixS. bind 
gSIU: J: oTlf $ ixfcft Ay 6 ^Of^fofc^ $> 

fiS S tt^o i&fs^Mt-. received^P ^ *S 
• o nE r r o r ; £ <D<1 <<ls V fl ^ 4 AT # h^jft 
aHf£*t£ 0 ^^^-O-h^jgS^S^, statu 

KM**!,*, 

[ 0 2 5 S ] 7 . 3 



20 



-bind: gritmofe:^ ^±-e**3ti*ttff U 

T^£*§<&§rl&#, r e c o<Z>*fr£WMIt. 
*param:reco OiH-ra <b SHII}C > s m e x 

M^^y h^d~ A#^^7*~^&*#M~ 
6 0 *P a r a mlggffclH f n a m e J JRtt^rflSJB U 

^cDieTfc^, — stis»iin?tt, sit 

10 2 5 6] 7. 4 #W&?R 



^ [0 2 5 8] ^Dw^»ftft»Stt^A^»6\ EC 
MA Script^ f p r g t o t y p e j "fa 4 
SrffifflUT, My-; Ki LTB8*SrW*+*P 



</smex> 



function addFunctionO ( 

logServer. prototype, iogMessage = 
my_iogMessage; 



</scr ipt> 

[ 0 2 5 9 } £ v ^ KJBrtio^^iSfer-oBBS; 

iogServer. lo^essage(KEC0_lOG_ERR0R, "My 
aessage") ; 

[0 2 6 0] t^tfl)««i«ittfei:«»-c»4S 
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[IS 2] fl 1 Oa^tfa-f ^ y^f /^^©i«Bt 
[El 3] tB®¥IHtfc5» 

[B4] *^**bt*>*. 

^irft^ilftftB - ?*)*. 

[H7] ?7 4T^hiM^mn^mt£^-?Tyymi& to 

[HIS] l^fcWf^Wi^WtSs ^7^TV 

— * j^iia -e fc 6 * 20 
f B-cfcfi. 



IB l 4 J tMm— /«om l coS)f)^^~ KSr*i"JMBB 
[B15] fSBEf— ^co*2 Og*#^~ K*r»f 
[Hi 6 ] WBr9—'«o%i 3 ^fflMt^E" K*r»i"lfcBB 

S 0 Jfttt&ttB 
2 0 2 We b1*— 
2 0 4 =ff J^U"—^ 

2 0 5 -fy^-^yh 

2 0 8 PSTN 

2 10 $sH#V o I P 

2 1 1 *J*ftifG>R e c o 

2 1 2 WS*^^^^ 

214 ^fr/f-^ (y^-HpTW^^*^ h) 

2 16 I E^7^ 

3 0 7 TTS 

309 mxnwrmm 

320 as p+-<— ^pi^mmm/Tisyi-- v 
324 ymm 



IBU 



35a 



35b 




35c 
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103] 








37 




— 


1 'I 





59 







D/A 





30 



34 



5*4 xvu-rmm 




60 





EH 4 J 
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(15] 



200 



324 





















, » 








320 202 
Webfl— /t 



/ 



Web 



205 





1 ^oVJ 


^ ». 


1 RAO I 





30 



aim 



SAP! 






Reco 











212 



211 



tit: 



307 



309 



204 



[He] 
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<html> 

<form lds # yot_c»rd^lnfo* method* "post" action « 'http :/ /pay** *nt + a*p-> 
<seiect nfime-^card^type** 

<o P tltJft v a Jue-*»m*x*> American £xpre*«</optkm> 
< option vslue-'visa^VI** </opfclon> 

</se(ect> 

< Input type™ 't^xt* name*^tt*r<Cnum" wtdth^'30 11 

<tnput type^*toxf ri3me~ p BXpiry_d»t*" /> 

fcype~**ut>irt1te" vahie» "Subm ft" onclick^ "v^HfyO" 

-c/forrn > 
<sCrlptxf[COATA[ 

function handle O i 

if (get_card^info.caxd_j:ype* value — "amex") [ 
if {getJcard_in£o , card man, length*" 1 st 15) 
262 alert {"arsex should have 15 digits"); 

V ) el^e 

if tget_card_info,o^rd_nu^ T length l«- 16) 

alert ("visa and flitter should have 16 digits 1 *); 

} 

function verify I ) { 

var flag «■ window. confirm r'aubfcit £he credit card info*? ) / 
it. (flag) 

^€t_cas:d in^Q - submit () ; 

J]* 
</scrtpt> 

[Hi 0] 

function _handleO { 
handleQ; 
checkF-fnedQ; 

function checkFilledQ { 

if Cca rd_ type, value = { 
p„carcL.type,acriveO ; 
9_ca rd^.ty pes . acti vat e O i ret u rn ; 

} 

if CcarcLnum. value v***"*) { 

p^cardLnunuacti veO ; g_card_nu«,acti vateC); return; 

> 

■if C^xpi ry^date . value «■ { , 

p_exp*ry fc data.acTweO; g^expi ry_date. activate Q ; return, 

p^content^acfi vate() ; 
p^confi rm.activateO ; 
conf -t rotation . activareO ; 

} 

function conf i rmedCgob j) { 

if Cgobj.recogfces.text «~ .yes ) 
get^cartLinfo.subimtO ; 

} 

// 

// user codes start here 

function bandleQ { 

if Cget_card_iftfo.card^rype «** "amex ,, :> { 

if (get_card_i nfo,card_num. length \ - 15; t 

prompt. speak O'amex should have 15 digits ); 
get_card„Jnfo.card_nLjm « 1 1 ; 

* & if e (g€t_card_info.card_num length i= 16) { _ 

prompt. speak ('visa and master should have 15 digits J>, 
get_card_info.card_nunt ™ r ; 

</scn pt> 
</body> 
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«rform \d * rc^inf*" method-*p»st" act io n***hfctps// pay ment.asp'> 

<selact name^ca retype" onclick* nalkC$_ca retypes )"> 

^option value- "am ex"> American £xpress</option> 
< option valtre = "vf«a"> Vlsa</option> 
< option V3rue*="m**> MasterCard < /options 
</seiect> 

<reeo td~"g„ca retypes* onReco »"h8»cfto() * > 

< grammar srt s "./gram #card_ty pes* /» 

</reca > ' 

<fnput type=**«ext* Jiame = *c3rd_num* wldth^'so* 

<reco id = B g^cBrd^num * onReco - 'hunrfleO ' > 

<gi*3mmar src »" Vgram JFdiglts " f> _____ 
</f€IC0 > ^ 

< Input type xt " text" name= "explry_date' *~ ~ 

ortC»ck=*talk{g_«xpiry_if»t«r /> 

<reco Fd~ ,, 9_axpiry_date H > 

< Orammar src=" ./gram«datet " ^ 
</reco > 

<toput feype= "submit* ya lue^ "Subm If onclick* "v^rifyO* / > 

<script=-<!CCDATA[ 

function t3lSc(gob^) f 
qvb 3 , activate ( ) ? 

> 

function handle <) ( 

if {get_card_inf o- card__num f = null) { 

if rget_card_inf&.card_type. value ^= H aKBX") 
if (get_card_in£Q,card_jauro. length !»- 15) 
alert {"sm&x should have 15 digits"); 
} «lse 

if {get^card^inco, car d_tmm. length f- 16) 
alert ("visa and master should have 16 digits*); 
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function verify (> { 

v^ar fl*g * window, con f irm t "submit th<* credit card 
Info?") r 

Ulag) { 

gec_c^j:d__info. su&rait {J ; 



if 
} 



> 



</scrlpc> 
</htm r> 
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< t»0£*Y > 

< form *d«* pet„c»rtf.Jnfo * method *>'p$**\* sctlon http://p*ymont»»«p • 
onActivirte wfeicoi»C> "> 

< prompt Id «" p_-welcoro* n > Wc.tiow need your credit card <f prompt > 

< prompt ki a * p_m«mbb* X dlctn't understand foil prompt > 

< prompt td „.««*„fcVI>» * b^rpfcln «* tru# *> What credit card would you 
</ prompt> 

< prompt Id « H p^carCnum * fcarpeio = * true "> Pt**t* **y the nurofcer* </ prompt > 

< prompt id p_explry_d»te ■ D^roelft -"t«r«*> Whit f* tt>» e*plr»tten 
date? <y prompt > 

~< prompt Id »* p_cont«nfc "> 

Ihflvcyonr < vaJue select cnrd^type ' /> <s volue sstect c*rd_r*u*if* * /> 
with expiration date 

< value select-* expjry^jtet* " /> 
</ prompt > 

< prompt id = * p_conflri*i *> ix this correct? </prompt> 

< reco id-*" B_card_typcs " onHoReeo mum ble(thtsv 2) ■> 

< grammar src »- * Vera w#«xard„typ** " 

< /reco > 

■c mtif Id ~ * g_o*r4„.«uw* " onMolteco - » m u m b l»( th it, I ) > 

anReco *' _hanttktCth trard^noro) * /> 

< grammar src^* ./» ram tf clip U* " /> 

</reco > * 
— < reco g m *xptry_<*at« " otiNaReco mumbMtbte/*} > 

onfteco _b*rt«M*{tbi*, expiry_dste) ** a»tflo = H rao * /> 

< oi-amma? src*-* ,/gra rr**d«t#* " /> 

t /reco > 

<rtco id^ x confirmation - ofiReco CP n firmed (this > * /> 

< grammar src «* " m#y*«»o ■ /> 
</reco > 

< setect n*mer *" card„_type 

< option va?uc *"nriw^ American Express </ option ?- 

< option value «"vlsa V**a </ option > 

-c option vaio« - M ms*> MaoterCardi </ options 

< / select > 

< Input type -* text" name c#rd„|f*um " width * * 30 * f> 

< Input type text" name - " »xpiry_ * t> 

< Input type - - submit H *&t*/« -* Submit * /> 
</ f&rm > 

* script ><t[COATAf 

r-uiicrlon welcome { ) { 

repeat: » 



cbes);FUlfl(J(} ; 

got,) .doactivat^ U ; 
p ULUtibXfe ■ Active ( ) i 

checkPil led 1) ; 
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<body> 

<form Jd^ w set_card_jr1fo ,, method- "post* actfor»=r w http://paynient-asp- 
onactivate- "welcome() , '> 

<prompt kJ=^p._welcome*>We now need yotrr credit card</prompt> 

< prompt id^p^mumble^I didn't understand you</prompt> 
<prornpt id=="p_card_type" bargein^true^Whai: credit card would you 

U£e?</prornpfc> 

<prompt id~*p_card_num w bargein="true">Please say the mimber</prompt> 
<prompt id~ M p_expfry_date* bargein^true^What is the expiration 
date?</prompt> 

< prompt id^p^conterrt^ 
I have your <vaiue setect-Vard.jtype" /xvafue setect-*card_nutiT f> 
with expiration date <va!ue seiect~ 1, 'expjryi.date'' /> 
</prompt> 

<prompt *p_conf1rm* >Is this correct? </prompt> 

<reco id^ H g_aet_card_info" onReco^handleQ" onNoReco ="roumbleCthter > 
grammar src s\/g rani /get Payment" /> 
<blnd target ^"card^type* vafua =V card/ type" /> 

<bind target- n card_nurn" valuG=Vcard/numher rt /> 

^ Ui) <bind target^^xpiry^date'* value ^Vcaird/exp^_date ,, /> 

</reco> 

<reco ^"confirmation* onReco ~ M confirmed(this) onNoReco- w murobIe(th!s3 ,, /> 

< grammar src «*,/gram#vcsrroV> 

select name - *ca rd_type m > 
<optlon vafue="a me:** > American Express</optlort> 
< option vafue- , Vfsa'> Visa </opt Jon > 
<option value^'ms"^ Mastercard </optlon^ 
</select> 

< input type*«**text" na m e = "ca rd /mm n width= H 30" /> 
<rnput type^'text" na rne ~ "expiry^date 11 /> 
<input type«*subm1t" va1ue«"5ubmit n /> 
</form> 

< script > < J [CDATAf 

on welcome Q 
p_wel come . acti ve O ; 
repeat * 0; 
checfcPilledO; 

function mumble CgobjD { 
gobj , deactivate O : 
p^mumble * active Q ; 
checkFilledO; 

function _handleC) { 
handle Q ; 
checkFinedO; 



> 



T 
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function checkFilledQ I 
if CcarcUtype* value — TBM ) { 

p_car<L.type* active O ; : dcufl eld. activate O ; return; 

if (cartLjnuzn. value { 

p„carcf_num*activeO? do_fie1d>activateO ; return; 

if {expiry_date, value 

p_expiry_data.aetiveO; do_fie?d.aceivateO; return; 

PLXonten t . acti vate C) ; 
p_conf 1 rm * ac ti vate Q ; 
conf i rmati on . acti vate O ; 

function confirmed (9003) { 

if (gobj * recogRes . text — "yes") 

Qet™carcLinfo.subfnitC9ens«i1 Q); 

J/ 

// user codes srart hera 
function handleO { 

if ffield =s get_card_info»card_nu?B") { 

if C^t^carcLinfo.caroLnum. length f» 15) { 

prompt. speak Caniex should have 15 digits 1 '}; 
get^card^info. card^num ~ "" ; 

> else 

if (get„c a rd„info.card_ntim. length ItT) { 

prompt v speak C*Visa should nave 15 digits"]) ; 
get_card_info.car£Unurts « 

function gensmlQ { 

str * '<sm1xcreditL~card type^"'; 
str card^type > val ue ; str += r "xnuniber>* ; 
str car d_numbe revalue; str '</rtumberxexpi re>* ; 
str +0* expiry^date.value; str 
* </expi rex/c red i t_cardx/sml> 1 ; 

return str; 

})> 
</script> 
</body> 
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SAMPLE ASP+ PAGE 

<%S Page language*** Jscript" AutoEventWireup" "false* Xnherics>» "Credit . Transaction" %> 
<h fcml Xhis a d > 

<!— ASPX page for both voice-only & multimodal credit card example — > 
<script> 

function handle O < 

££ {£i«ld get^card^irxCo . eardrum} $ 

if {get_cara_Info-caxd__iiuiis. length != 153 { 
pronipt . speak ("amer should have 15 digits"}; 
get cttrd_info.catd._num - wtr ; 

\ 

} else 

if (get_c4rd_info,card_a van- length !» 16) i 
prompt . speak ( "visa should have 15 digi ts") t 
g&t ca rd_inf o „ eardjrrom * '* " ; 

} 

function g-ensanl () { 

9tx » r <srta><credit_card><cajcd__tvpe>* j 

str +** card^type rvalue; atr 4- '</cArd_type><JiUTOber> , f 

str card^aumbdr . value ; str +** * <c/mmvber><expir©> * ; 

str +« expiry^date, value; str »</exp±re></credit_eardx/sml> ' ; 

return str; 

\ 

< /script > 

<3cxipt rujnat- t ^rv«r*> 
function Pago Load (obj, args) f 
if [PoiTtBac*;, { 

validator - new System Speech. SWLVaiidft tor , /Credit SPfc. jcml*) j 
dsxnl validator. Evaluate (args ) ? 
Navigate {ChooaeS?age idsral) ) ; 
} else { 

// initial ize fielda with args 

} 

} 

</scrxpt> 

</head> 

<fcody> 

<spcech;fom id~"get__card^in£ o" »tyle-« H »V3te5i_i^itiative T * 
prompb«*" , /prompt /get Payment" opsubmit^gerisml 0 "> 
<speech; choice n<tme^"caxd„type rT prompt ="Wnat credit card vould you user* 
/grajnfrcard type*" onShxas*? ini^h-^handle 0 * 
<opt ioa>;jVm©rican 3£jKpr^s$</option> 
<opfciojt> Vi sa</ op t ion> 
<CQption>^astfercard </option> 
</spe«ch: choic^f 

<speecfi:textbox najne«="card_JiujTtber" prompt**" Please say the number" 

gramma r w *' ./grain* digits" o^Ph rase Fin is ^"handle f ) "> 
Opeech: text bow naroe^expiry^dafce " prompt^'What is the oxpiratioa date? H 

grammar*-" . /graia* dates" onPhra^eFinish^"handle { J "/> 

</#pee ch : fonn> 

</hody> 

</html> 
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<htral> 
<bcdy> 

<?— the data section ~~ > 
350 <form id« / '^et_drinfc /y > 

<input name- "drink" /> 
< input type** "radio" rawe«'*cream'V> 
<input types* "radio" name- / 'svigajc ,r /> 
<input najne="uid" typ&'="hidden' f /> 
<Yform> 
<!!-- The speech section — > 
352 cprowpt id*"welcome^>welcone, caller* <^pxanipt> 

V <prtanpt id-^ask''^ Do you want cob, coffee, or orange juice? 
</prompt> 

<prompt id~"conxTirjn"> I heard <value href ^"drink"/> - Is this 

correct? </prampt> 
<prompt id^"thanks*> Than* you. Please wait when I get it for 

you. <7proropt> 
<pr-oir;pt id="retry"> Ofcay, Jet'* do this again </prompt> 
<pj^DKipt ld**"j,*p*-0Mpt"> Scj:±:y, I missed t;h»t* </prompt> 
<proi7ipt id»"cream_sog3r">Do you want cream or so gar with your 
cof f ec ? < /pxoanpt > 

<reco id ^"reco__dr in fc-xgramnar srt:=". /drinktypeE"/> 
364 — ■ <bind test="/i§confirience IG]" 

target El ement-^r&proKipt" tArget*fethod~"start" 
targBt£lemenfc-*"as 3c' taxgetitfethod**" start" 
targetSI Ewnent="re co^drin fc" t arg^tMethod^ "s ta re " /> 
<bind tas t-"/ drink /cof fee I (3 confidence SgtS 10]" 
targetElfement^dxiaic"' vraiue^ "/drink" 
targetEi.entent^creaiit^sugar'' targe tMethod^" start" 
t a rg e t E 1 enient** *r eco_c r eara__s\igar " 
tQrgetHethod~"itart'V> 
<bind test**"/ [(^confidence $gtS 10}" 

targetElGaaeot^'drLrik" valuer tf /dr ink" 
targetEIeroent="co:nfir*n" taraetffethod**'" start" 
^argetEiement-^recc^yesno" targetMechod=".start V> 

</reco> 
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<r<*co id«"rec© cresre_sugar"><gr3JT3na.x src*". /creara+stagarV> 
<bind tesB^VC© confidence $gt$ 10 a^d 

hostO/g&t drink/drinlc = * coffee T' 
3 56 targetBlemSnt^cream* target Attxib«te="cheD>ced" 

values'/cream/ Rvalue" 
targetElemBRt^"sugax" t3rget^ttribute-"chfccked'' 

valuer"/ sugar/Gvalue"* 
targe 1 21 emejxt-'* con f i rrc w t a rget M6<fchod-"st a r t " 

c/r$ce> 

<reco id-^rfcco^yesno*^ <gran*nar szc-". /yesno"/> 

<binci test-"/yesi§confidencC! $^t$ 10}" 

- targetEKsment**'" thanks" targetMethod~"stairt" 

358 targetMsthod-' r sub7tiit" /> 

V <bind test='7uo or > / [8cott£ idei>ce $le$ 103" 

— - taxgetE:iement«''retry" targetKethod-'-^tart" 

. targetEleinent^ask" taxgetttetbod-"start" 

T;&rg*ttf*thod-''startV> 
</reco> 

O-call control section --> 

<smex id-" telephone" sent«":tt.arr_li5tening w ><para» 
sarver="contaproc"> .„ </paxam> 

<bi.nd targetEl«ment«"uid w value*" /@uicf*7> 
36 0 <bind test^"/C&ll__connected" 

V 3 61"—- target^l^ment-'V Become" targetMethod-* Start" 

ta^getEleraent^asr' t argetMethod-"s tart ~ 

3 62 ' t:argetEleiriGnt»*'reco_drinJ:'* 
3 63 sargetMethod- "starts 

</$Kiex> 
</body> 
</htrnl> 
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1 . Ti tie of the Invention 

IxMsfGUAGE HXTEKSJONS FOR W&B ENABLED 
RECOGNITION 

2, Claims 

1. A computer readable medium having a markup language for 
execution on a client device in a client /server system, the 
markup language comprising an instruction indicating a 
grammar to associate with input entered through the client 
device* 

2. The computer readable medium of claim 1 wherein the 
instruction indicating a grammar provides a reference to a 
location of the grammar, 

3. The computer readable medium of claim 1 wherein the 
grammar is for one of speech recognition, handwriting 
recognition, gesture recognition and visual recognition* 

4. The computer readable medium of claim 3 wherein the 
instruction indicating a grammar includes a reference to a 
grammar for speech recognition. 

5. The computer readable medium of claim 1 wherein the 
markup language includes an instruction for receiving a 
recognition result pertaining to racogni2ed speech and 
associating the result with a data field on the client 
device* 

6. The computer readable medium of claim 8 wherein the 
instruction for receiving a recognition result associates the 
recognition result to a plurality of data fields. 
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7. The computer readable medium of claim 1 wherein the 
markup language includes an instruction to audibly render an 
indication when the speech was not recognized. 

S, The computer readable medium of claim 1 wherein the 
input comprises a text string. 

5. The computer readable medium of claim 8 wherein the 
markup language includes an instruction for receiving the 
text string from another component. 

10. The computer readable medium of claim 9 wherein markup 
language associates the grammar to process the text string. 

11, A computer readable medium including instructions 
readable by a computer which, when implemented, cause the 
computer to handle information by performing steps 
comprising: 

receiving a markup language page from a web server 

having a field for input data; 
receiving input from a user related to the field; and 
sending data indicative of the input and an indication 

of a grammar for recognition. 

12, The computer readable medium of claim 11 wherein the 
indication provides a reference to locations of the grammars, 

13. The computer readable medium of claim 12 wherein the 
indication includes a reference to a language for speech 
recognition. 
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14. A method for recognition in a client /server network, the 
method comprising: 

receiving a markup language page having a field for 
input data on a client device connected to a 
network from a web server at an address on the 
network different than the client device; 

receiving input from a user related to the field on the 
client device; and 

sendinq data indicative of the input and an indication 
of a grammar for recognition to a recognition 
server at an address on the network different than 
the client, device. 

15. The method of claim 14 and further comprising: 
normalizing the data indicative of the input prior to 

sending the data . 

16. The method of claim 41 and further comprising; 
receiving a recognition result pertaining to recognized 

speech and associating the result with a data field 
on the client device, 

17. A computer readable medium having a markup language for 
execution on a client device in a client/server system, the 
markup language comprising an instruction indicating an 
object model element having an attribute or a method for 
recognition . 

18. The computer readable medium of claim 17 wherein the 
element is for one of speech recognition, handwriting 
recognition, gesture recognition, DTMF recognition and visual 
recognition. 
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19* The computer readable medium of claim 17 wherein the 
element comprises an indication of a grammar for recognition. 

20* The computer readable medium of claim 19 wherein element 
pertains to receiving and processing a text string like 
recognition. 

21. The computer readable inedium of claim 20 and further 
comprising a second object jaodel receiving the text string 
front another component, 

22. A computer readable medium having a markup language for 
execution on a client device in a client/server system, the 
markup language comprising an instruction indicating an 
object model element having an attribute or a method for 
audibly prompting. 

23. The computer readable medium of claim 22 wherein the 
markup language comprises one of HTML, XHTML, cHTML, XML and 
WML. 

24. The computer readable medium of claim 22 wherein the 
markup language comprises a scripting language. 

25. The computer readable medium of claim 22 wherein the 
markup language comprises a synchronized markup language. 
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3. Detailed Description of the Invention 
FIELD OF THE INVENTION 

The present invention relates to access of information 
over a wide area network such as the Internet. More 
particularly, the present invention relates to web enabled 
recognition allowing information and control on a client side 
to be entered using a variety of methods. 

BACKGROUND OF TUB INVENTION 

Small computing devices such as personal information 
managers <PEM). devices and portable phones are used with 
ever increasing frequency by people in their day-to-day 
activities. With the increase in processing power now 
available for microprocessors used to run these devices., the 
functionality of these devices are Increasing, and in some 
cases, merging. For instance, many portable phones now can be 
used to access and browse the Internet as well as can be used 
to store personal information such as addresses, phone 

numbers and the like. 

in view thai, these computing devices are being used tor 
browsing the Internet, or are used in other server/client 
architectures, it i« therefore necessary to enter information 
into the. computing device. On fortunately , due to the desire 
to keep these devices as small' as possible in order that they 
are easily carried, conventional keyboards having all the 
letters of the alphabet as isolated buttons are usua: ly not 
possible due to the United surface area available on the 
housings cf the computing devices. 

Recently, voice portals such as through the use of 
Vo.iceXML (voice extensible markup language) have been 
advanced to allow Internet content to be accessed using only 
a telephone. In this architecture, a document server (for 



(57) #12 0 0 3- 1 3 1 7 7 2 

example, a web server) processes requests frons a client 
through a VoiceXML interpreter. The web server can produce 
VoiceXML documents in reply r which are processed by the 
VoiceXML interpreter and rendered audibly to the user, Using 
voice commands through voice recognition, the user can 
navigate the web. 

VoiceXML is a markup language with flow control tags; 
however, flaw control does not follow the HTML {Hyper Text 
Markup Language) flow control model, which includes eventing 
and separate scripts. Rather, VoiceXML generally includes a 
form interpretation algorithm that is particularly suited for 
telephone-based voice-only interaction, and commonly, where 
the information obtained from the user is under the control 
of the system or application. Incorporation of VoiceXML 
directly into applications available in a client-server 
relationship where graphically user interfaces are also 
provided will require the developer to master two forms of 
web authoring, one for VoiceXML and the other usi ng HTML {or 
the like), each one following a different flow control model. 

There is thus an ongoing need to improve upon the 
architecture and methods used to provide speech recognition 
in a server/client architecture such as the Internet. The 
authoring tool for speech recognition should be easily 
adaptable to small computing devices such as PIMs, telephones 
and the like. An architecture or method of web authoring that 
addresses one, several or all of the foregoing disadvantages 
is particularly needed* 
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SUMMARY OP THIS INVENTION 



A markup language for execution on a client device in a 
client/server system includes an instruction indicating a 
grammar to associate with input data entered through the 
client device. 

With the availability of this extension and as another 
aspect of the present invention, a client device can execute 
instructions to receive a markup language page from a web 
server having a field for input data. The client device can 
then receive input data from a user related to the field and 
send the data and an indication or" the grammar for 
recognition to a recognition server, typically, located at a 
remote location for processinq. 

The recognition server can execute instructions to 
receive the input data and the indication of the grammar to 
perform recognition. The results of recognition can then be 
sent back to the client device or web server for further 
processing . 
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DETAILED DESCRIPTION OF THE INVENTION 



Before describing an architecture of web based 
recognition and methods for implementing the same, it may be 
useful to describe generally computing devices that can 
function in the architecture. Referring now to FIG. 1, an 
exemplary form of a data management device (PIM, PDA or the 
like) is illustrated at 30. However, it is contemplated that 
the present invention can also be practiced using other 
computing devices discussed below, and in particular, those 
computing devices having limited surface areas for input 
buttons or the like. For example, phones and/or data 
management devices will also benefit from the present 
invention. Such devices will have an enhanced utility 
compared to existing portable personal information management 
devices and other portable electronic devices, and the 
functions and compact size of such devices will more likely 
encourage the user to carry the device at aU times. 
Accordingly, it is not intended that the scope of the 
architecture herein described be limited by the disclosure of 
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an exemplary da La management ox device, phone or computer 

herein illustrated. 

An exemplary form of a data management, mobile device 30 
is illustrated in FIG . 1. The mobile device 30 includes a 
housing 32 and has an user interface including a display 34, 
which uses a contact sensitive display screen in conjunction 
with a stylus 33. The stylus 33 is used to press or contact 
the display 34 at designated coordinates to 3elect a field, 
to selectively move a starting position of a cursor f or to 
otherwise provide command information such as through 
gestures or handwriting. Alternatively, or in addition, one 
or more buttons 35 can be included on the device 30 for 
navigation. In addition, other input u.echanisnis such as 
rotatable wheels, rollers of the like can also be provided. 
However, it should be noted that the invention is not 
intended to be limited by these forms of input mechanisms. 
For instance, another form of input can include a visual, 
input such as through computer vision. 

Referring now to FIG, 2, a block diagram illustrates the 
functional components comprising the mobile device 30* A 
central processing unit (CPU J 50 implements the software 
control functions. CPU 50 is coupled to display 34 so that 
text and graphic icons generated in accordance with the 
controlling software appear on the display 34. A speaker 43 
can be coupled to CPU 50 typically with a digi tal-to-analoc 
converter 59 to provide an audible output. Data that is 
downloaded or entered by the user into the mobile device 30 
is stored in a non-volatile read/write random access memory 
store 54 bi-cirectiona lly coupled to the CPU SO. Random 
access memory (RAN) 54 provides volatile storage for 
instructions that are executed by CPU 50, and storage for 
temporary data, such as register values. Default values for 
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configuration options and other variables are stored in a 
read only memory (RON) 58. ROM 58 can also be used to store 
the operating system software for the device that controls 
the. basic functionality of the mobile 30 and other operating 
system kernel functions (e.g., the loading of software 
components into RAM 54), 

RAM 54 also serves as a storage for the code in the 
manner analogous to the function of a hard drive on a PC that 
is used to store application programs. It should be noted 
that although non-volatile memory iy used for storing the 
code, it alternatively can be stored in volatile memory thar 
is not used for execution of the code. 

Wireless signals can be transmitted/received by the 
mobile device through a wireless transceiver 52, which is 
coupled to CPU 50. An optional communication interface 60 can 
also be provided for downloading data directly from a 
computer (e.g., desktop computer), or from a wired network, 
/,f desired. Accordingly, interface 60 can comprise various 
forms of communication devices , for enamplc, an infrared 
link, modem, a network card, or the like. 

Mobile device 30 includes a microphone 29, and analog- 
to-digital (A/D) converter 37, and an optional recognition 
program {speech, DTMF, handwriting, gesture or computer 
vision) stored in store 54. By way of example, in response to 
audible information, instructions or commands from a user of 
device 33, microphone 29 provides speech signals, which are 
digitized by A/D converter 37/ The speech recognition program 
can perform normalization and/or feature extraction functions 
on uhe digitized speech signals to obtain intermediate speech 
recognition results. Using wireless transceiver 52 or 
communication interface 60 f speech data is transmitted to a 
remote recognition server 20 4 discussed below and illustrated 
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in the architecture of FIG. 5. Recognition results are then 
returned to mobile device 30 for rendering (e.g. visual 
aad/or audible) thereon, and eventual transmission to a web 
server 202 (FIG. 5), wherein the web server 202 and mobile 
device 3C operate in a client/server relationship- Similar 
processing can be used for other forms of input. For example, 
handwriting input can be digitized with or without pre- 
processing on device 30. Like the speech data, this form of 
input can be transmitted to the recognition server 204 for 
recognition wherein the recognition results are returned to 
at least one of the device 3D and/or web server 202. 
Likewise, DTMF data, gesture data and visual data can be 
processed similarly. Depending on the form of input, device 
30 (and the other forms of clients discussed below) would 
include necessary hardware such as a camera for visual input. 

FIG. 3 is a plan view of an exemplary embodiment of a 
portable phone 80. The phone 80 includes a display 82 and a 
keypad 84, Generally, the block diagram of FIG. 2 applies zo 
the phone of FIG, 3, although additional circuitry necessary 
to perform other functions may be required, For instance, a 
transceiver necessary to operate as a phone will be required 
for the embodiment of FIG. 2; however, such circuitry is not 
pertinent to the present invention. 

In addition to the portable or mobile computing devices 
described above,. it should also be understood that the 
present invention can be used with numerous other computing 
devices such as a general desktop computer. For instance, the 
present invention will allow a user with limited physical 
abilities to input or enter text into a computer or other 
computing device when other conventional input devices, such 
as a full alpha-numeric keyboard, are too difficult to 
operate . 
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The invention is also operational with numerous other 
general purpose or special purpose computing systems, 
environments or configurations. Examples of well known 
computing systems, environments, and/or configurations that 
may be suitable for use with the invention include, but are 
not limited to, regular telephones (without any screen) 
personal computers, server computers, hand-held or laptop 
devices , multiprocessor systems, microprocessor-based 
systems, set top boxes, programmable consumer electronics, 
network PCs, minicomputers, mainframe computers, distributed 
computing environments that include any of the above systems 
or devices, and the like. 

The following is a brief description of a general 
purpose computer 12C illustrated in FIG. 4. However, the 
computer 120 is again only one example or a suitable 
computing environment and is not intended to suggest any 
limitation as to the scope of use or functionality of the 
invention. Neither should the computer 120 be interpreted as 
having any dependency or requirement relating to any one or 
combination of components illustrated therein. 

The invention may be described in the general 
context of computer-executable instructions, such as 
program module*,, being executed by a computer. Generally, 
program modules include routines, programs, objects, 
components, data structures, etc. that perform particular 
tasks . or implement particular abstract data types. The 
invention may also be practiced in distributed computing 
environments where tasks are performed by remote processing 
devices that are linked through a communications network. 
In a distributed computing environment, program modules may 
be located in both local and remote computer storage media 
including memory storage devices. Tasks performed by the 
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programs and modules are described below and with the aid 
of figures. Those skilled in the art can implement the 
description and figures as processor executable 
instructions, which can be written on any form of a 
computer readable medium. 

With reference to FIG * 4, components of computer 
120 may include, but are not limited to, a processing unit 
140, a system memory ISO , and a system bus 141 that couples 
various system components including the system memory to 
the processing unit 140. The system bus 141 may be any of 
several types of bus structures including a memory bus or 
memory controller, a peripheral bus r and a local bus using 
any of a variety of bus architectures. By way of example, 
and not limitation, such architectures include Industry 
Standard Architecture [ISA) bus, Universal Serial Bus 
(U3B) , Micro Channel Architecture (MCA) bus, Enhanced rSA 
(EISA) bus, Video Electronics Standards Association (VESA) 
local bus, and Peripheral Component Interconnect {PCI) bus 
also known as Mezzanine bus, computer 120 typically 
includes a variety of computer readable mediums. Computer 
readable mediums can be any available media that can be 
accessed by computer 120 and includes both volatile and 
nonvolatile media, removable and non-removable media. By 
way of example, ar.d not limitation, computer readable 
mediums may comprise computer storage media and 
communication media* Computer storage media includes both 
volatile and nonvolatile, removable and non-removable media 
implemented in any method or technology for storage of 
information such as computer readable instructions , data 
structures, program modules or other data. Computer storage 
media includes, but is not limited to, RAM, ROM, EEPROM, 
flash memory or other memory technology, CD-ROM, digital 
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versatile disks (DVD) cr other optical disk storage, 
magnetic cassettes, magnetic tape, magnetic disk storage or 
other magnetic storage devices, or any other medium which 
can be used to store the desired information and which can 
be accessed by computer 12 0. 

Communication media typically embodies computer 
readable instructions, data structures, program modules or 
other data in a modulated data signal such as a carrier 
wave or other transport mechanism and includes any 
information delivery media . The term ^modulated data 
signal" means a signal that has one or more of its 
characteristics set or changed in such a manner as to 
encode information in the signal. By way of example, and 
not limitation, communication media includes wired media 
such as a wired network or direct-wired connection, and 
wireless media such as acoustic, FR, infrared and other 
wireless media. Combinations of any of the above should 
also be included within the scope of computer readable 
media . 

The system memory 150 includes computer storage 
media in the form of volatile and/or nonvolatile memory 
such as read only memory {ROM) 151 and random access memory 
(RAH) 152. A basic input/output system 153 (BIOS), 

containing the basic routines that help to transfer 
information between elements within computer 120, such as 
during start-up, is typically stored in ROM 151. RAN 152 
typically contains data and/or program modules that are 
immediately accessible to and/or presently being operated 
on by processing unit 140. By way of example, and not 
limitation, FIG. 4 illustrates operating system 54, 
application programs 155, other program modules 156, and 
program data 157. 
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The computer 120 may also include other 
removable /non-removable volatile /nonvolatile computer 
storage media. By way of example only, FIG. 4 illustrates 
a hard disk drive 161 that reads from or writes to nan- 
removable, nonvolatile magnetic media, a magnetic disk 
drive 171 that reads from or writes to a removable, 
nonvolatile magnetic disk 172, and an optical disk drive 
17 5 that reads from or writes to a removable, nonvolatile 
optical disk 176 such as a CD ROM or other optical media. 
Other removable/non-removabie, volatile/nonvolatile 

computer storage media that can be used in the exemplary 
operating environment include, but are not limited to, 
magnetic tape cassettes, flash memory cards, digital 
versatile disks, digital video tape, solid state RAM, solid 
state ROM, and the like. The hard disk drive 161 is 
typically connected to the system bus 141 through a non- 
removable memory interface such as interface 160, and 
magnetic disk drive 171 and optical disk drive 175 are* 
typically connected to the system bus 141 by a removable 
memory interface, such as interface 170. 

The drives and their associated computer storage 
media discussed above and illustrated in FIG. 4, provide 
storage of computer readable instructions, data structures , 
program modules and other data for the computer 120. In 
FIG. 4, for example, hard disk drive 161 is illustrated as 
storing operating system 164, application programs 165 , 
other program modules 166, and program data 167. ftote that 
these components can either be the same as or different 
from operating system 154, application programs 155, other 
program modules 156, and program data 157. Operating system 
164, application programs 165, other program modules 166, 
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and program data 167 are given different numbers here to 
illustrate that, at a minimum, they are different copies, 

A user may enter commands and information into 
the computer 120 through input devices such as a keyboard 
182, a microphone 183, and a pointing device 181 f such as a 
mouse, trackball or touch pad. Other input devices (not 
shown) may include a joystick, game pad, satellite dish, 
scanner, or the like. These and other input devices are 
often connected tc the processing unit 140 through a user 
input interface 180 that is coupled to the system bus, but 
may be connected by other interface and bus structures, 
such as a parallel port, game port or a universal serial 
bus {USB). A monitor 184 or other type of display device 
is also connected to the system bus 141 via an interface, 
such as a video interface 185. In addition to the monitor, 
computers may also include other peripheral output devices 
such as speakers 187 and printer 186, which may be 
connected through an output peripheral interface 18 8. 

The computer 120 may operate in a networked 
environment using logical connections to one or more remote 
computers, such as a remote computer 134. The remote 
computer 194 may be a personal computer, a hand-held 
device, a server, a router, a network PC, a peer device or 
other common network node, and typically includes many or 
all of the elements described above relative to the 
computer 120. The logical connections depicted in Fiq. 4 
include a local area network (LAN!) 191 and a wide, area 
network (WAN) 193, but may also include other networks. 
Such networking environments are commonplace in offices, 
enterprise-wide computer networks, intranets and the 
Internet . 
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When used in a LAN ■ networking environment, the 
computer 120 is connected to the LAN 191 through a network 
interface or adapter 190. When used in a WAN networking 
environment, the computer 120 typically includes a modem 
192 or other means for establishing communications over the 
WAKi 193, such as the Internet. The modem 192, which may be 
internal or external, may be connected to the system bus 
141 via the user input interface 180, or other appropriate 
mechanism. In a networked environment, program modules 
depicted relative to the computer 120, or portions thereof, 
may be stored in the remote memory storage device. By way 
of example, and not limitation, FIG. 4 illustrates remote 
application programs 195 as residing on remote computer 
194. It will be appreciated that the network connections 
shown are exemplary and other means of establishing a 
communications link between the computers may be used. 

FIG. 5 illustrates architecture 200 for web h<af>ei<± 
recognition as can be embodied in the present invention. 
Generally, information stored in a web server 202 can be 
accessed through mobile device 30 (which herein also 
represents other forms of computing devices having a 
display screen r a microphone, a camera, a touch sensitive 
panel, etc. r as .required based on the form of input) r or 
through phone 8 0 wherein information is requested audibly 
or through tones generated by phone 80 in response to keys 
depressed and wherein information from web server 202 is 
provided only audibly back to the user. 

More importantly though, architecture 200 is 
unified in chat whether information is obtained through 
device 30 or phone 8 0 using speech recognition, 3 single 
recognition server 204 can support eithGr mode of 
operation. In addition, architecture 200 operates using an 



(69) %fM 2003-131772 

extension of well-known jnark-up languages (e*g. HTML, 
XHTML, cHTML, XML, WML, and the like). Thus, information 
stored on web server 2 02 can also be accessed using well- 
known GUI methods found in these mark-up languages. By 
using an extension of well-known mark-up languages, 
authoring on the web server 202 is easier, and legacy 
applications currently existing can be also easily modified 
to include voice recognition- 

Generally, device 30 executes HTML pages, 
scripts, or the like, provided by web server 202. when 
voice recognition is required, by way cf example, speech 
data, which can be digitised audio signals or speech 
features wherein the audio signals have been preprocessed 
by device 30 as discussed above, are provided to 
recognition server 203 with an indication of a grammar or 
language model to use during speech recognition. The 
implementation of the recognition server 204 can take many 
forms, one of which is illustrated, but generally includes 
a recognizer 211. The results of recognition are provided 
back to device 30 for local rendering if desired or 
appropriate. Upon compilation of information through 
recognition and any graphical user interface if used, 
device 30 sends the information to web server 202 for 
further processing and receipt of further HTML 
pages/scripts, if necessary. 

As illustrated in FIG. 5, device 30, web server 
202 and recognition server 204 are commonly connected, and 
separately addressable, through a network 205, herein a 
wide area network such as the Interne-, It therefore is not 
necessary that any of these devices be physically located 
adjacent each other. In particular, it is not necessary 
that web server 202 includes recognition server 204. In 
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this manner, authoring at web server 202 can be focused on 
the application to which it is intended without the authors 
needing to know the intricacies of recognition server 204. 
father, recognition server 204 can be independently 
designed and connected to the network 205, and thereby, be 
updated and improved without further changes required at 
web server 202. As discussed below, web server 202 can also 
include an authoring mechanism that can dynamically 
generate client-side markups and scripts. In a further 
embodiment, the web server 202, recognition server 204 and 
client 3 0 may be combined depending on zhe capabilities of 
the implementing machines- For instance, if the client 
comprises a general purpose computer, e.g. a personal 
computer, the client may include the recognition server 
204. Likewise, if desired, the web server 202 and 
recognition server 204 can be incorporated into a single 
machine* 

z\n aspect oz the present invention is a method 
for processing input data in a client /server system that 
includes receiving from a server a markup language page 
having extensions configured to obtain input data from a 
user of a client device; executing the markup language page 
.on the client device; transmitting input data (indicative 
of speech, DTMF, handwriting, gestures or images obtained 
from the user) and an associated grammar to a recognition 
server remote from the client; and receiving a recognition 
result from the recognition server at the client, Another 
aspect is a computer readable medium having a markup 
language for execution on a client device in a 
client/server system, the markup language having an 
instruction indicating a grammar to associate with input 
data entered through the client device. 
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Access to web server 202 through phone 80 
includes connection of phone 80 to a wired or wireless 
telephone network 208, that in turn, connects phone 80 to a 
third party gateway 210. Gateway 210 connects phone 80 to a 
-elephony voice browser 212, Telephony voice browser 212 
includes a media server 214 that provides a telephony 
interface and a voice browser 216. Like device 30, 
relephony voice browser 212 receives HTML pages/scripts or 
zhe like from web server 202. More importantly though, the 
HTML pages/scripts are of the form similar to H?ML 
pages/scripts provided to device 30. In this manner, web 
server 202 need not support device 30 and phone 8Q 
separately, or even support standard GUI clients 
separately. Rather, a common mark-up language can be used. 
In addition, like device 30, voice recognition from audible 
signals transmitted by phone 80 are provided from voice 
browser 216 to recognition server 204, either through the 
network 2 05, or through a dedicated line 207, for example, 
using TCP/ IP. Web server 202, recognition server 204 and 
telephone voice browser 212 can be embodied in any suitable 
computing environment such as the general purpose desktop 
computer illustrated in FIG. 4. 

However, it should be noted that if DTMF 
recognition is employed, this form of recognition would 
generally be performed at the media server 214, rather than 
at the recognition server 204 . In other words, the DTMF 
grammar would be used by the media server. 

As indicated above, one aspect of the present 
invention includes extension of mark-up languages such as 
HTML , XHTML cHTML, XML, WML or with any other SGML-derived 
markup to include controls and/or objects that provide 
recognition in a client /server architecture. In this 
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manner, authors carx leverage all the tools and expertise in 
these mark-up languages that are the predominant web 
development platform used in such architectures. 

Generally, controls and/or objects can include 
one or more of the following functions: recognizer controls 
and/or objects for recognizer configuration, recognizer 
execution and/or post-processing; synthesizer controls 
and/or objects for synthesizer configuration and prompt 
playing; grammar controls and/or objects for specifying 
input grammar resources; and/or binding controls and/or 
objects for processing recognition results. The eKtensione 
are designed to be a lightweight markup layer, which adds 
the power of an audible, visual, handwriting, etc. 
interface to existing markup languages. As such, the 
extensions can remain independent of: the high-level page 
in which they are contained, e.g. HTML; the low-level 
formats which the extensioas used to refer to linguistic 
resources, e.g. the text-to-speech and grammar formats; and 
the individual properties of the recognition and speech 
synthesis platforms used in the recognition server 204. 

Before describing mark-up languages having 
controls and/or objects suited for recognition, it may be 
helpful to examine a simple GUI example herein embodied^ 
with the HTML mark-up language. Referring to FIG. 6, a 
simple GUI interface comprises submission of credit card 
information, to the web server tc complete an on-line sale. 
In this example, the credit card information includes a 
field 250 for entry of the type of credit card being used, 
for example, Visa r MasterCard or American Express. A second 
field 252 allows entry of the credit card number, while a 
third field 254 allows entry of the expiration date. Submit 
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button 264 is proviced to transmit the information entered 
in fields 250, 252 and 254. 

FIG. 7 illustrates the HTML code for obtaining 
the foregoing credit card information from the client. 
Generally, as is coirraon in these forms of nark-up 
languaqes, the code includes a body portion 260 and a 
script portion 2 62. The body portion 260 includes lines of 
code indicating the type of action to be performed, the 
form to use, the various fields of information 250, 2 52 and 
254, as well as a code for submir button 264 (FIG. 6). This 
example also illustrates eventing support and embedded 
script hosting, wherein upon activation of the submit 
button 264, a function "verify" is called or executed in 
script portion 262. The -verify" function ascertains 
whether the card number length for each of the credit cards 
(Visa, MasterCard and American Express) is of the proper 
length. 

FIG. 3 illustrates a client markup that generates 
the same GUI of FIG . 6 for obtaininq credit card 
information to be provided to web server 204 using speech 
recognition. Although speech recognition will be discussed 
below with respect to FIGS . 8-14, it should be understood 
that the techniques described can be similarly applied in 
handwriting recognition, gesture recognition and image 
recognition . 

Generally, the extensions (also commonly known as 
"tags") are a small set of XML elements, wirh associated 
attributes and DOM object properties, events and methods, 
which may be used in conjunction with a source markup 
document to apply a recognition interface, DTMF or call 
control to a source page. The extensions formalities and 
semantics are independent of the nature of the source 
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document, so the extensions can be used equally effectively 
within HTML, XHTML, cHTML, XMT„ WML, or with any ozher 
SGML-derived markup. The extension follow the document 
object model wherein new functional objects or elements, 
which can be hierarchical, are provided- Each of the 
elements are discussed in derail in the Appendix, but 
generally the elements can include attributes, properties, 
methods, events and/or other "child" elements - 

At this point, it should also be noted that the 
extensions may be interpreted in two different * modes" 
according to the capabilities of the device upon which the 
browser is being executed on. In a first mode, * object 
mode*, the full capabilities are available. The 
programmatic manipulation of the extensions by an 
application is performed by whatever mechanisms are enabled 
by the browser on the device, e.g. a JScript interpreter in 
an XHTML browser, or a WMLScript interpreter in a WML 
browser. For this reason, only a small set of core 
properties and methods of the extensions need to be 
defined, and these manipulated by whatever programmatic 
mechanisms exist on the device or client side. The object 
mode provides eventing and scripting and can offer greater 
functionality to give the dialog author, a much finer 
client-Side control over speech interactions. As used 
herein, a browser that supports full event and scripting is 
called an "uplevel browser . This form of a browser will 
support all the attributes, properties, methods and events 
of the extensions. Upievel browsers are commonly found on 
devices with greater processing capabilities. 

The extensions can also be supported in a 
^declarative mode" . As used herein, a browser operating in 
a declarative mode is called a "downlevel browser'" and does 
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not support full eventing and scripting capabilities. 
Rather, this form of browser will support the declarative 
aspects of a given extension (i.e. the coro element and 
attributes )~, but possibly not all the DOM (document object 
model) object properties, methods and events. This mode 
employs exclusively declarative synLax, and may further be 
used in conjunction with declarative multimedia 
synchronization and coordination mechanisms (synchronized 
markup language) such as SMIL (Synchronized Multimedia 
Integration Language) 2.0. Downlevel browsers will 
typically be found on devices with limited processing 

capabilities , 

At this point though, a particular mode of entry 
should be discussed. In particular, use of speech 
recognition in conjunction with at least a display and, in 
a further embodiment, a pointing device as well tc indicate 
the fields for data entry is particularly useful. 
Specifically, in this mode of data entry, the user is 
generally under control of when to select a field and 
provide corresponding information. For instance, in the 
example of FIG. 6, a user could first decide to enter the 
credit card number in field 2S2 and then enter the type of 
.credit card in field 250 followed by the expiration date in 
field 254. Likewise, the user could return back to field 
252 and correct an errant entry, if desired. When combined 
with speech recognition as described below, an easy and 
natural form of navigation is provided. As used herein, 
this form ot entry using both a screen display allowing 
free form selection of fields and voice recognition is 
called ^multi-modal" . 

Referring back to FIG. 8, an example of HTML 
mark-up Language code is illustrated. Lite the HTKL code 
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illustrated in FIG* 7, this code also includes a body 
portion 270 and a script portion 272. Also like the code 
illustrated in FIG. 7, the code illustrated in FIG- 8 
includes indications as to the type of action to perform as 
well as the location of the form* Entry of information in 
each of the fields 250, 252 and 254 is controlled or 
executed by code portions 280, 282 and 284, respectively- 
Referring first to code portion 280, on selection of field 
250, for example, by use of stylus 33 of device 30, the 
event "onclick" is initiated which calls or executes 
function "taik" in script portion 272- This action 
activates a grammar used for speech recognition that is 
associated with the type of data generally expected in 
field 2 50. This type of interaction, which involves more 
than one technique of input (e.g. voice and pen- 
click/roller) is referred as ^multimodal". 

It should be noted that the speech recognition 
extensions exemplified in Fig. 8 are not intended to have a 
default visual representation on the browser of the client, 
since for many applications it is assumed that the author 
will signal the speech enablement of the various components 
of the page by using applicat ion-specification graphical 
mechanisms in the source page. Nevertheless, if visual 
representations are desired, the extensions can so be 
modified. 

Referring now back to the grammar, the grammar is 
a syntactic grammar such as but not limited to a context- 
free grammar, a N-grammar or a hybrid grammar. (Of course, 
DTMF grammars, handwriting grammars, gesture grammars and 
image grammars would be used when corresponding forms of 
recognition are employed. As used herein, a "grammar" 
includes information for performing recognition, and in a 
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further embodiment , information corresponding tc expected 
input to be entered, for example, in a specific field) A 
new control 290 (herein identified as "reco"), comprising a 
first extension of the mark-up language, includes various 
elements, two of which are illustrated, namely a grammar 
elenent "grammar" and a "bind" element. Generally, like the 
code downloaded to a client from web server 202 , the 
grammars can originate at web server 202 and be downloaded 
to the client and/or forwarded to a remote server for 
speech processing. The grainmars can then be stored locally 
thereon in a cache. Eventually, the grammars are provided 
to the recognition server 20 4 for use in recognition. The 
grammar element is used to specify grammars, either inline 
or referenced using an attribute. 

□pon receipt of recognition results from 
recognition server 204 corresponding to the recognized 
speech, handwriting, gesture, image, etc., syntax of reco 
control 290 is provided to receive the corresponding 
results and associate it with the corresponding field, 
which can include rendering of the text therein on display 
34* In the illustrated embodiment, upon completion of 
speech recognition with the result sent back to the client, 
it deactivates the reco object and associates the 
recognized text with the corresponding field. Portions 282 
and 284 operate similarly wherein unique reco objects and 
grammars are called for each of the fields 252 and 254 and 
upon receipt of the recognized text is associated with each 
of the fields 252 and 254. With respect to receipt of the 
card number field 252, the function "handle" checks the 
length of the card number with respect to the card type in 
a manner similar to that described above with respect to 
FIG . 7. 
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Generally/ use of speech recognition in 
conjunction with architecture 200 and the client side mark- 
up language occurs as fellows: first, the field that is 
associated with the speech to be given is indicated. In the 
illustrated embodiment , the stylus 33 is used; however, it 
should be understood that the present invention is not 
limited to use of the stylus 33 wherein any form of 
indication can be used such as buttons , a mouse pointer, 
rotatable wheels or the like. Corresponding event such as 
"onClick" can be provided as is well known with use of 
visual mark-up languages. It should be understood that the 
present invention is not limited to the use of the 
^onClick" event to indicate the start of voice, 
handwriting, gesture, etc commands. Any available GUI event 
can be used for the same purpose as well, such as 
tt onSelect ; . In one embodiment, such eventing is 
particularly useful for it serves to indicate both the 
beginning and/or end of the corresponding speech. It should 
also be noted that the field for which the speech is 
directed at can be indicated by the user as well as 
programs running on the browser that keep track of user 
interactions . 

At this point, it should be stated that different 
scenarios of speech recognition require different behaviors 
and/or outputs from recognition server 204. Although the 
starting of the recognition process is standard in al_ 
cases - an explicit start {> call from uplevel browsers, or 
a declarative <reco> element in downlevel browsers - the 
means for stopping speech recognition may differ. 

In the example above, a user in a multimodal 
application will control input into the device by, for 
example, tapping and holding on a pressure sensitive 
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display- The browser then uses a GUI event, e.g. "pen-up" , 
to control when recognition should stop and then returns 
the corresponding results. However, in a voice-only 
scenario such as in a telephone application (discussed 
below) or in a hands-free application, the user has no 
direct control over *rhe browser r and the recognition server 
204 or the client 30 , must take the responsibility of 
deciding when to stop recognition and return the results 
(typically once a path through the grainmar has been 
recognized). Further, dictation and other scenarios where 
intermediate results need to be returned before recognition 
is stopped (also known as "open microphone" ) not only 
requires an explicit stop function, but also needs to 
return multiple recognition results to the client 30 and/or 
"web server 202 before the recognition process is stopped. 

In one embodiment , the Reco element can include a 
"mode" attribute to distinguish the following three modes 
of recognition, which instruct the recognition server 204 
how and when to return results. The return of results 
implies providing the ^onReco" event or activating the 
"bind" elements as appropriate. In one embodiment , if the 
mode is unspecified, the default recognition mode can be 
"automatic". 

h'lGr 12 is a pictorial representation of 
operation of the * automatic" mode for speech recognition 
(similar modes, events, etc. can be provided for other 
forms of recognition). A timeline 281 indicates when the 
recognition server 204 is directed to begin recognition at 
283 r and where the recognition server 204 detects speech at 
285 and determines that speech has ended at 287. 

Various attributes of the Reco element control 
behavior of the recognition server 204. The attribute 
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%x initialTimeout' f 289 is the time between the start of 
recognition 283 and the detection of speech 2 85. If this 
time period is exceeded , "onSilence" event 2 92 will be 
provided from the recognition server 204, signaling that 
recognition has stopped. If the recognition server 204 
finds the utterance to be unrecognizable, an "onKoReco" 
event 293 will be issued, which will also indicate that 
recognition has stopped. 

Other attributes that can stop or cancel 
recognition include a " babbie Time out" attribute 295 , which 
is the period of time in which the recognition server 204 
must return a result after detection of speech at 285- If 
exceeded, different events are issued according to whether 
an error has occurred or not. Tf the recognition server 204 
is still processing audio, for example, in the case of an 
exceptionally long utterance, the >x onNoReco" attribute 293 
is issued. However, if the w babbleTimeout" attribute 2 95 is 
exceeded for any other reason, a recognizer error is more 
likely and an "onTimaout" event 297 is issued. Likewise, a 
"maxTiiaeout" attribute 29 9 can also be provided and is for 
the period of time between the start, of recognition 28 3 and 
the results returned to the client 30, If this time period 
is exceeded, the "onTimeout" event 297 is issued. 

~f , however, a time period greater than an 
"endSiience" attribute 301 is exceeded, implying that 
recognition is complete, the recognition server 204 
automatically stops recognition and returns its results. It 
should be noted that the recognition server 204 can 
implement a confidence measure to determine if the 
recognition results should be returned. If the confidence 
measure is below a threshold, the "cnNoReco" attribute 293 
is issued, whereas if the confidence measure is above the 
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threshold a "onNoReco" attribute 303 and the results of 
recognition are issued. FIG. 12 thereby illustrates that in 
* automatic mode" no explicit stop () calls are made. 

FIG. 13 pictorially illustrates * single mode" 
operation of the recognition server 2C4* Attributes and 
events described above with respect to the * automatic mode" 
are applicable and are so indicated with the same reference 
numbers- However , in this mode of operation, a stop (} call 
305 is indicated on timeline 281. The stop () call 305 
would correspond to an event such as * pen-up" by the user. 
In this mode of operation, the return of a recognition 
result is under the control of the explicit stop () call 
305. As with all modes of operation, the "onSilence" event 
2 91 is issued if speech is not detected within the 
M initialTimeout" period 289, but for this mode of operation • 
recognition is not stopped. Similarly, a ''onNoReco^ event 
293 generated by an unrecognizable utterance before the 
stop () call 305 does not stop recognition. However, if the 
time periods associated with the "babblelimeout" attribute 
295 or the w maxTimeout" attribute 299 are exceeded 
recognition will stop- 

FIG, 14 pictorially illustrates "multiple mode" 
operation of the recognition server 204. As indicated 
above, this mode of operation is used for an "open- 
nijc-opbone" or- i n a dictation scenario. Generally, in this 
node of operation, recognition results are returned at 
intervals until an explicit stop ()_ call 305 is received 
or the time periods associated with the s ' babbleTimeout" 
attribute 295 or the 1s maxTimeout f/ attribute 299 are 
exceeded. It should be noted, however, that after any 
^onSilence" event 2 31, ^onReco" event 30 3, or xv onNoReco" 
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event 293 r which does not stop recognition, timers for the 
* babble Timeout" and "max Timeout" periods will be reset, 

Generally, in this node of operation r for each 
phrase that is recognized, a ^onReco" event 303 is issued 
and the result is returned until the stop () call 305 is 
received. If the ^onSilence" event 291 is issued due to an 
unrecognizable utterance these events are reported but 
recognition will continue. 

As indicated above r the associated reco object or 
objects for the field Is activated, which includes 
providing at least an indication to the recognition server 
204 of which grammar to use. This information can accompany 
the speech data recorded at the client: 30 and sent to the 
recognition server 2 04- As indicated above, speech data can 
comprise streaming data associated with the speech entered 
by the user, or can include pre-processed speech data 
indicating speech features that are used during speech 
recognition. In a further embodiment r client side 
processing can also include normalization of the speech 
data such that the speech data received by the recognition 
server 204 is relatively consistent from client to client. 
This simplifies speech processing of the recognition server 
204 thereby allowing easier scalability of the recognition 
server 2 04 since Lhe recognition server can be made 
stateless with respect to the type of client and 
coramunicat ion channel. 

Upon receipt of the recognition result from the 
recognition server 204, the recognition result is 
associated with the corresponding field, and client-side 
verification or checking can be performed, if desired. Upon 
completion of all of the fields associated with the code 
currently rendered by the client, the information is sent 
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to web server 202 for application processing. From the 
foregoing, it should be clear that although the web server 
202 has provided code or pages/scripts suitable for 
recognition to ~he client 30, the recognition services are 
not performed by the web server 202, but rather by the 
recognition server 204. The invention, however, does not 
preclude an implementation where the recognition server 204 
is collocated with the web server 2 02, or the recognition 
server 204 is parz of the client 30, In other words, the 
extensions provided herein are beneficial even when the 
recognition server 204 is combined with the- web server 202 
or client 30 because the extension provide a simple and 
convenient interface between these components. 

While not shown in the embodiment illustrated in 
FIG. 8, the reco control can also include a remote audio 
object {RAO) to direct the appropriate speech data to the 
recognition server 204. The benefit for making RAO a plug- 
in object is to allow a different one for each different 
device or client because the sound interface may likely be 
different. In addition, the remote audio object can allow 
multiple reco elements to be activated at the same time. 

PIGS * 9A and 9B illustrate a voice-only mark-up 
language embodied herein as HTML with pages /scripts . As 
clearly illustrated, the code also includes a body portion 
300 and a script portion 302. there is another extension of 
the markup language - proir.pt control 303 which include 
attributes like bargein. However, speech recognition is 
conducted differently in the voice-only embodiment of FIGS. 
9A and 9B . The process is now controlled entirely by the 
script function "checkFi lied" which will determine the 
unfilled fields and activate correspondent prompt and new 
objects, Nevertheless, grammars arc activated using the 
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same context as that described above with respect to FIG, 
B r wherein speech data and the indication of the grammar to 
use are provided to the recognition server 204. Likewise/ 
the output received from the recognition server 204 is 
associated with fields of the client (herein telephony 
voi.ce browser 212) * 

Other features generally unique to voice-only 
applications is an indication to the user when speech has 
not been recognized. In multimodal applications such as Fig 
8, 'oQNoReco' simply puts mill value on the displayed field 
to indicate no-recognition, thus no further action is 
required. In the voice-only embodiment: , u onNoKeco" 305 
calls or executes a function "mumble" , which forwards a 
word phrase to recognition server 204,, that in turn, is 
converted to speech using a suitable text-to-speech system 
307 (FIG. 5) . Recognition server 204 returns an audio 
stream to the telephony voice browser 212, which in turn, 
is transmitted to phone 80 to be heard by the user. 
Likewise, other waveform prompts embodied in the voice-only 
application are also converted, when necessary, to an audio 
stream by recognition server 204. 

It should be noted that in this example after 
playing the welcome prompt via function "welcome" , function 
>s cheokFilied' r prompts zhe user for each of the fields and 
activates the appropriate grammars, including repeating the 
fields that have been entered and confirming that tho 
information is aorrea~, which includes activation of a 
"confirmation" grammar. Note in this embodiment, each of 
the reco controls ±s initiated from the scrip: portion 302, 
rather than the body portion of the previous example. 

As another aspect of the present invention, the 
markup language executable on different types of client 
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devices {e.g. multimodal and non-display, voice input based 
client devices such as a telephone) unifies at least one of 
speech-related events, GUI events and telephony ©vents for a 
web server interacting with each of the client devices. This 
is particular advantageous for it allows significant portions 
of the web server application to be written generically or 
independent of the type of client device. An example is 
illustrated in FIGS. 8 and 9A r 9B with the * handle" 
functions . 

Although not shewn in Fig 9, there are two more 
extensions to the markup language to support telephony 
functionality - DTMF (Dual Tone Modulated Frequency) 
control and call control elements or objects. DTMF works 
similarly to reco control. It specifies a simple grammar 
mapping from keypad string to text input. For example, "1" 
means grocery department, "2" mean pharmacy department, 
etc. On the other hand, call object deals with telephony 
functions, like call transfer and 3 rd party call. The 
attributes, properties, methods and events are discussed in 
detail in the Appendix - 

FIGS - 10A and 10B illustrate yet another example 
of a mark-up language suitable for a voice-only mode of 
operation. In this embodiment, the user is allowed to have 
some control over when information is entered or spoken. In 
other words, although the system may initiate or otherwise 
direct the user to begin speaking, the user may offer more 
information than what was initially asked for. This is an 
example of "mixed initiative". Generally, in this form of 
dialog interaction, the user is permitted to share the 
dialog initiative with the system. Besides the example 
indicated above and discussed below in detail where the 
user provides more information then requested by a prompt, 
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the user could also switch tasks when not prompted to do 
so* 

In the example of FIGS 10A and 10B r a grammar 
identified as ^do__£ield" includes the information 
associated with the grammars M g_card_types" , "g_cardjnum" 
and "g_expiry__date" . In this example, telephony voice 
browser 212 sends speech data received from phone 80 and an 
indication to use the "do_field" grammar to recognition 
server 204 upon receipt of the recognized speech as denoted 
by ^onReco", the function "handle" is called or executed 
thai includes associating the values for any or all of the 
fields recognized from the speech data. In other words, the 
result obtained from the recognition server 204 also 
includes indications for each of the fields. This 
information is parsed and associated with the corresponding 
fields according to binding rules specified in 405. As 
indicated in FIG. 5, the recognition server 204 can include 
a parser 309, 

From FIGS. 7, 8, 9A, 9B. 10A and 10B, a very 
similar web development framework is used. Data 
presentation is also very sinilar in each of these cases. 
In addlLion r the separation of data presentation and flow 
controls allow maximum reusability between different 
applications {system initiative and mixed-initiative), or 
different modalities {GUI veb-based, voice-only and 
multimodal) . This also allows a natural extension from 
voice-only operation through a telephone to a multimodal 
operation when phones include displays and functionalities 
similar to device 30. Appendix A provides further details 
of the controls and objects discussed above. 

As indicated above, uplevel browsers can use 
scripting in order to perform various needs such as 
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invoking the function tx handle" in the above examples to 
assign the recognition results. In the embodiments 
described above and as further described in Appendix A at 
2.1.2, the "bind" element will parse the recognition 
results and assign values wherein the rv bind" element is a 
subelement or child element of the "reco" element. 

Although scripting can be useful, it is believed 
by many not to be always the best form of browser 
implementation due, for example, to security concerns. 
Therefore, in yet another embodiment or aspect of the 
present invention, the VN bind" element is a high level 
element (similar to 

" reco" } and is provided with other richer properties, which 
can in effect mimic scripting without scripting per se. 

Without using scripting or without using the 
aspect of the presen- invention discussed below, some of 
the capabilities discussed below such as sophisticated 
dialog effects could only be accomplished by submitting a 
page back to the web server 202, executing the application 
logic thereon to generate a new page, and sending the page 
back to the client device. This aspect of the present 
invention allows a programmer to invoke methods on objects 
of the page without incurring a server roundtrip. 

In zhe embodiments described above, the "bind" 1 
element has only the attributes "TargetElement" and 
^Target At tribute" for assigning recognition results to a 
field in the form or web page. In a further embodiment, the 
u bind" element also includes a " TargetMethod" , which is 
added for object method invocation. The use and capability 
of ^TargetMethod" is the principal technique for mimicking 
scripting. For example, the following syntax can bo used to 
invoke the ' V X" method of the object *OBJl r ' : 
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<bind TargetElernent - "OBJl" TargetMethod - 

v> X" . . . > . 

Note that although the examples shown here follow 
the HTML /XHTML event syntax, those skilled at art should 
find it straightforward to generalize the <bind> usage to 
other eventing mechanisms, including but not limited to W3C 
Document Object Model Level 2 or Level 3 eventing standard, 
ECMA Common language Infrastructure (CLi) event model, Java 
programming language event model, K3C Synchronous 
Multimedia Integration Language (SMIL), and Lhe upcoming 
W3C XML Events standard proposal. 

FIGS. 15A and 15B are a page of mark-up language 
executable on a client, particularly a downlevel browser. 
In this example, the user is asked through audio prompts 
for a drink. The system then confirms ,what drink was 
ordered. Depending on the recognition results, the "bind" 
element guides the execution using declared logic, When the 
drink is confirmed, the form is submitted back to the web 
server 202 , all without scripting. 

Generally, the mark-up example of FIGS. 15A and 
15B includes a data section 350, a speech section 352 and 
user interface sections 354, 356 and 358. Section 354 
receives the recognition result from the general inquiry as 
to what drink the user would like and directs interactive 
recognition flow to either reprompt ing , inquiring as to 
whether cream or sugar is needed, or confirming the drink 
ordered. In particular, section 356 receives a recognition 
result when cream or sugar is also ordered. Section 358 
receives the recognition result for confirmation of the 
drink. Section 360 is a call control section employing a 
new messaging object *SMEX", which is discussed further 
below. 
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As indicated above, the ^bind" element of this 
aspect of the present Invention includes object method 
invocation, which will initiate user interaction in the 
example of FIGS, 15A and 15B by playing the * welcome" 
prompt when the "start" method on the ~ welcome'' object is 
executed at 361* 

The user is then asked "Do you want coke, coffee 
or orange juice?" by executing the "start" method ot the 
x * asked" object at 3 62. Recognition is then performed by 
invoking the "start" method on the recognition *reco_ 
drink" object at 363, 

The mark-up of section 354 is then executed 
wherein the grammar used by the recognition server 204 is 
provided by the Xpazh statement -./drink types". Note that 
although this example utilizes W3C . Xpath language, those 
skilled at art should find it straightforward to extend the 
concept to other standard languages, including but not 
limited to W3C XML Query Language (XQL) - As specified by 
the >N bind" element 364 r if the recognition result received 
from the recognition server 204 has a confidence score less 
than 10, the prompt, object "reprompt" is executed at 366, 
followed by the prompt object ^ask" 3 68, at wh 5 oh point, 
the recognition object v reco_drink" is reinitiated at 370. 
If the recognition result returned is "coffee" having a 
confidence greater than 10, the field * drink" is assigned 
the value of the recognition result at 372 and the user is 
then prompted as to whether or nut he/she would like cream 
or sugar by the prompt object " creaiu_sugar" at 314. The 
recognition object ^reco^cream sugar" in section 356 is 
then invoked at 376. Otherwise, if the confidence score of 
the recognition result is greater than 1C, but not coffee, 
the field drink is then again assigned at 378. Confirmation 
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of the recognition result is provided at 380 by executing 
the prompt object % confirm" , followed by invocation of the 
recognition object "reco yesno" in section 358 at 382. If 
the user answers x> yes" with a confidence score greater than 
10, the prompt object "thanks" is played at 384 and then 
the form is then submitted at 386. Otherwise, if the user 
answers ~no" or the confidence score of the recognition 
result is less than 10, the prompt object "retry" is 
executed at 3 90, one© again followed by the prompt object 
"ask" being executed at 392 , and invocation of the 
1Y reco_drink" recognition object at 394 * 

From -he foregoing example, the ^bind" element 
allows multiple invocations of methods as indicated in 
sections 354, 356 or 35B* If desired, multiple assignments 
of the recognized result can also be declared. In the 
embodiment illustrated, when multiple assignments and 
method invocations are declared, they are executed in the 
document order. 

In a further, embodiment, a convention for passing 
method arguments is also provided. In other words, some 
methods may require a list of arguments. This is achieved 
by using "arg T1 subelement. For example, given the following 
mark-up: 

<bind Targe tElement ^ Targe tHetbod 

^ F" ><arg>X</argXarg>Y</arg></bind> 
is equivalent to "OBJ. F (X, Y)" , or "OBJ" is an object that 
has a method * F" with parameters or arguments VX X" and T. 

The "bind" element can also include an w event" 
attribute, which declares the bind elemenL is intended for 
which event. For example, the mark-up: 

<bind event = "onMoReco" - TargetElsment 

"promptl" TargetMethod - " start" /> 
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means when the "on^oReco" event is seal, the method "start" 
of -ho object ™ prompt 1" will be invoked. To be consistent 
with use of the "bind" element as a child element of the 
"Reeo" element as described above for example with respect 
-o PIG. 8, the default attribute for the "bind" element is 
tt onReco" , 

The v bind" element as a high level element can 
include any of the events specified in section 2,4 of the 
Appendix. In addition, the "bind" element can also include 
a ^onError" event having a * status" attribute that can be 
accessed and used to direct program flow. To the extent 
other events of the "bind" element have * status" 
attributes, these can be accessed as well. 

In addition to checking the conditions of the 
recognition result, the current document or page being 
executed can also be checked as well. In particular, both 
the "test" and the "value" attribute can be extended to 
include a "host" primitive that refers to the root node of 
the containing document. For example, referring back to 
FIGS. ISA, 15B, the example contained "herein has 
additional logic at section 354 to ask whether the user 
v;ants cream or sugar when he/she asks for coffee. The flags 
for adding cream or sugar and thus invocation of section 
356, will be turned on only if the drink field is "coffee" 
as specified by the markup 

"host ( } /get_drink/drink=' coffee' " . 

It should also be noted, the iv bind" element is 
not only applicable to recognition results from ~he speech 
server 204 and receive or assign values in the document, 
but also to message objects (herein denoted by "smex", for 
example, from applications running on the client device. In 
the example of FIGS. 15A and 15B, the page is executed when 
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a telephony application running on the client device 
detects a call. In section 360, the M bind" element executes 
or plays the * welcome" prompt and begins recognition by 
executing the w reco_drink" object, when the message 
VCal reconnected" is received. Like the recognition results 
received from the speech server 204 , the messages received 
can vary greatly. Some of the messages are well -defined in 
order to initiate desired program flow. Others can be 
received and processed (e.g. parsed just like received 
recognition results of the recognition server. For example, 
this allows the markup to be used like a natural language 
parser of text from a keyboard* The reco element in 
Appendix A includes a property for performing this 
function. Likewise, the prompt element can be used to 
provide a textual message for dynamic content or audio wave 
files by using the property "innertext" , also further 
explained in the Appendix A. Eventing can be similar to 
eventing for recognition results. For example, eventing can 
include " onReceived" , which gels sent when the message 
source (e.g- application running on the client device) has 
a message available for the browser. 

The "smex" or message object thus allows the 
markup tags discussed herein to be extended to other 
components or applications running on the client device. As 
another example, the message object can be used to 
communicate to a TTY component for the hearing impaired 
running on the client device. Rather than using speech 
recognition, the TTY component will provide a message of 
wha~ the user has typed. This message is then used as if a 
recognition result was received from the recognition server 
wherein the message can be parsed and assigned to fields of 
the form, or other processing can take place using the 
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"reco", "grammar'* or "bind" elements described above. 
Further discussion of the message or "siriex n object is 
provided in Appendix A, 

The lx bind" element can also include a "for" 
attribute, which allows its action to be attached to other 
objects on the page. For example, a mark-up such as: 

<bind for = * prompt 1" event = "onComplete" 

targetElement - "prompt 2" - targetMethod = 

" start" />. 

will invoke the start method of the object w prompt 2" when 
the object "prompt V* sends the event y * onComplete" . 

Referring bac^ to FIG. 5, web server 202 can 
include a server side plug-in declarative authoring tool or 
module 320 (e.g. ASP or ASP+ by Microsoft Corporation, JSP, 
or the like). Server side plug-in module 320 can 
dynamically generate client-side mark-ups and even a 
specific form of mark-up for the type of client accessing 
the web server 202. The client information can be provided 
to the web server 202 upon initial establishment of the 
client/server relationship, or the web server 202 can 
include modules or routines to detect the capabilities of 
the client. In this manner, server side plug-in module 320 
can generate a client side mark-up for each of the voice 
recognition scenarios, i.e. voice only through phone 80 or 
multimodal for device 30. By using a consistent client side 
model (reco and prompt controls that can be used in each 
application) , application authoring for many different 
clients is significantly easier. 

In addition tc dynamically generating client side 
mark-ups, high-level dialog modules, like getting credit 
card information illustrated in FIG . 6 with a marJc-up 
examples of FIGS. 8, 9A and 9B, can be implemented as a 
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server-side control as stored in store 324 for use by 
developers in application authoring- In general, the high- 
level dialog modules 324 would generate dynamically client- 
side markup and script in both voice-only and multimodal 
scenarios based on parameters specified by developers. The 
high-level dialog modules can include parameters to 
generate client-side mark-ups to fit the developers' needs. 
For example, a credit card information module can include a 
parameter indicating what types of credit cards the 
client/side mark-up script should allow* A sample ASP4 page 
using in server side plug-in module 320 is illustrated in 
FIG* 11. 

Although the present invention has been described 
with reference to preferred embodiments, workers skilled in 
the art will recognize that changes may be made in form and 
detail without departing from the spirit and scope of the 
invention , 
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APPENDIX A 

1 Introduction 

The following tags are a set of markup elements that allows 
a document to use speech as eh input or output medium. The 
tags are designed to be self-contained XML that can be 
imbedded into any SGML derived markup languages such as 
HTML, XHTML, cHTML, SMIL, WML and the like. The tags used 
herein are similar to SAPI 5.0, which are known methods 
available from Microsoft Corporation of Redmond, Washington. 
The tags, elements, events, attributes, properties, return 
values, etc. are merely exemplary and should not be 
considered limiting. Although exemplified herein for speech 
and DTNF recognition, similar tags can be provided for other 
forns of recognition. 

The main elements herein discussed are: 

<prompt for speech synthesis configuration and 

prompt playing 

<reco .__> for recognizer configuration and recognition 

execution and post-processing 

<grajamar „> for specify iny Input gr aroma x resources 
<biiid „> for processing of recognition results 
<dtmf ...> for configuration and control of DTMF 

2 Re go 

The Reco element is used to specify possible user inputs and 
a means for dealing with the input results. As such, its 
main elements can be <granimar> and <bind>, and it contains 
resources for configuring recognizer properties. 
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Reco elements are activated programmaticaliy in uplevel 
browsers via Start and Stop methods r or in SMIL-enabled 
browsers by using SNIL conimands. They are considered active 
declaratively in dowrlevel browsers (i.e. non script- 
supporting browsers) by their presence on the pace. In order 
to permit che activation of multiple grammars in parallel, 
multiple Reco elements may be considered active 
simultaneously. 

Recos may also take a particular mode ~ ^automatic' r 
"single' or ^multiple/ ~ to distinguish the kind of 
recognition scenarios which they enable and the behavior of 
the recognition platform. 

2.1 JReco content 

The Reco element contains one or more grammars and 
optionally a set of bind elements which inspect the results 
of recognition and copy the relevant portions to values in 
the containing p$ge. 

In uplevel browsers, Reco supports the programmatic 
activation and deactivation of individual grammar rules.. 
Note also that all top-level rules in a grammar are active 
by default for a recognition context. 

2,1,1 <g-;raJTimar> element 

The grammar element is used to specify grammars, either 
inline or referenced using the src attribute. At least one 
grammar (either inline or referenced) is typically 
specified. Inline grammars can be text-based grammar 
formats, while referenced grammars can be text-based or 
binary type. Multiple grammar elements may be specified. If 
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more than one grammar element is specified, the rules within 
grammars are added as extra rules within the same grammar. 
Any rules with the same name will be overwritten. 



Attributes : 

• src: Optional if inline grammar is specified. URI of 
the grammar to be included. Note that all top-level 
rules in a grammar are active by default for a 
recognition context. 

o langlD: Optional. String indicating which language 

speech engine should use. The string format follows the 
xmltlang definition. For example, langID="en-us" 
denotes US English, this attribute is only effective 
when the langJD is not specified in the grammar URI- If 
unspecified, defaults to US English. 

If the langID is specified in multiple places then 
langID follows a precedence order from the lowest scope 
- remote grammar file (i.e language id is specified 
within the grammar file} followed by grammar element 
followed by reco element . 

<grammar src*"FromCity ♦ xml" /> 
or 

<grarnmar> 

<rule toplevel=*"active"> 
<p>f x om </p> 

<ruleref narae^"cit ies" /> 
</rule> 

<rule name= w c:itie5"> 
<1> 

- <p> Cambridge </p> 
<p> Seattle </p> 
<p> London </p> 
</!> 
</rule> 
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</grammar> 

If both a src-referenced grammar and an inline grammar are 
specified, the inline rules are added to the referenced 
rules, and any rules with the same name will be overwritten, 

2.1.2 <bind> element: 

The bind element is used to bind values from the recognition 
results into the page. 

The recognition results consumed by the bind element can be 
an XML document containing a semantic markup language (SML) 
for specifying recognition results. Its contents include 
semantic values, actual words spoken, and confidence scores* 
SML could also include alternate recognition choices (as in 
an N-best recognition result) . A sample SML document for the 
utterance "I'd like to travel from Seattle to Boston" is 
illustrated below ; 

<sml conf idence~"40"> 

<travel text^'X'd like to travel from Seattle to 
Boston"> 

<origin_city conf idence="4 5"> Seattle 

</origin_city> 

<des"__city conf idence^"35"> Boston 

</dest_city> 

</travel> * 
</sml> 

Since an in-uxammar recognition is assumed to produce an XML 
document ~ in semantic markup language, or SML - the values 
to be bound from the 3NL document are referenced using an 
XPath query. And since the elements in the page into which 
the ralues will be bound should bo are uniquely identified 
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(they are likely to be form controls), these target elements 
are referenced directly. 

Attributes : 

« targetElement: Required, The element to which the value 
content from the SML will be assigned {as in W3C SMIL 
2.0) . 

• tar-getAtte-ibutie: Optional. The attribute of the target 
element to which the value content from the SML will be 
assigned (as with the attributeName attribute in SMIL 
2,0) - Xf unspecified, defaults to "value". 

• test: Optional. An XML Pattern (as in the W3C XML. DOM 
specification) siring indicating the condition under 
which the recognition result will be assigned. Default 
condition is true. 

• value: Required- Am KB&TH (aa in the W3C XML DOM 
specification) string that specifies the value from the 
recognition result document to be assigned to the 
target element. 

Example: 

So given the above SML return, the following reco element 
uses bind to transfer the values in origin_city and 
dest city into the target page elements txtBoxOrigin and 
txtBoxDest : 

<input name=' 1 txtBoxOrigin" type="t:ext n /> 
cinput name^'txtF-oxDest 1 " type- "text" /> 

Oreco id="t ravel " > 

<graxnrnar src &M . /ci ty . xr&l " /> 
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<bind 



targetElement= M txtBo?iOrigin 
value ^V/origin^city" /> 



<bind 



targe t Element-" txtBoxDest 
value="//dest_city n /> 



</reco> 



This binding may be conditional, as in the following 
example, where a test is rr.ade on the confidence attribute of 
the dest_city result as a pre-condition to the bind 
operation: 

<bind targetElement^ n txtBoxDest" 
valuer " //de st__city M 

test^Vsiril/dest_city fQconf idence $gt$ 40]" 

/> 

The bind element is a simple declarative means cf processing 
recognition results on downlevel or uplevel browsers. For 
more complex processing, the reco DOM object supported by 
uplevel browsers implements the onReco event handler to 
permit programmatic script analysis and post -processing of 
the recognition return. 

2 2 Attributes and prop&xrtl&s 

The following attributes are supported by all browsers, and 
the properties by uplevel browsers. 

2.2.1 Attributes 

The following attributes of Reco are used to configure the 
speech recognizer for a dialog turn. 
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• initial^lmeotit: Optional. The time in milliseconds 
between start of recognition and the detection of 
speech. This value is passed to the recognition 
platform, and if exceeded, an onSilence event will 
be provided from the recognition platform (see 
2.4,2). If not specified, the speech platform will 
use a default value - 

• baKbleTimeout : Optional. The period of time in 
milliseconds in which the recognizer must return a 
result after detection of speech. For recos in 
automatic and single mode, this applies to the 
period between speech detection and the stop call. 
For recos in 'multiple' mode, this timeout applies 
to the period between speech detection and each 
recognition return - i.e. the period is restarted 
afLer each return of results or other event. If 
exceeded, different events are thrown according to 
whether an error has occurred or .not. If the 
recognizer is still processing audio - eg in the 
case of an exceptionally long utterance - the 
onNoReco event is thrown, with status code 13 (see 
2.4.4). If the timeout is exceeded for any other 
reason, however, a recognizer error is more likely, 
and the onTimeout event is thrown. If not specified, 
the speech platform will default to an internal 
value . 

• maxTinteoTit : Optional. The period of time in 
milliseconds between recognition start and results 
returned to the browser. If exceeded, the onTimeout 
event is thrown by the browser - this caters for 
network or recognizer failure in distributed 
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environments. For recos in ''multiple' mode, as with 
babbleTimeout, the period is restarted after the 
return of each recognition or other event. Note that 
the maxTimeout attribute should be greater than or 
equal to the sum of ini tialTiioeout and 
babbleTimeout. If not specified, the value will be a 
browser default. 

• endSilence: Optional. For Recos in automatic mode, 
the period of silence in milliseconds after the end 
of an utterance which must be free of speech after 
which the recognition results are returned. Ignored 
for recos of modes other than automatic* If 
unspecified, defaults to platform internal value. 

• reject: Optional. The recognition rejection 
threshold, below which the platform will throw the 
^no reco' event. If not specif red, the speech 
platform will use a default value. Confidence scores 
range between 0 and 100 (integer) . Reject values lie 
in between. 

♦ server: Optional, URI of speech platform {for use 
when the tag interpreter and recognition platform 
are not co-located) . An example value might be 
server^protocol : / /your speechplat Form* An application 
writer is also 1 able to provide speech platform 
specific settings by adding a guerystring to the URI 
string, eg 

protocol ; //your$peechplatform?bargainEn<ergyThr€>shold 

* langID: Optional. String indicating which language 
speech engine should use. The string format follows 
the xmlrlang definition. For example, langXD=''en~us" 
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denotes US English, This attribute is only effective 
when the langID is not specified in the graininar 
element {see 2.1.1). 
* mode: Optional. String specif ying the recognition 
mode to be followed. If unspecified, defaults to 
"automatic" mode. 

2-2.2 Properties 

The following properties contain the results returned by the 
recognition process (these are supported by uplevel 
browsers) . 

* recoResult Read-only. The results of recognition, held 
in an XML DOM node object containing semantic markup 
language {SML}, as described in 2,1.2, In case of no 
recognition, the property returns null. 

. text Read/Write. A string holding the text of the 
words recognised (i.e., a shorthand for contents of the 
text attribute of the highest level element in -he SKL 
recognition return in recoResult In a read mode. In 
write mode, a string can be assigned, which will then 
be pa,xsed as if the string corresponds to the 
recognition result. The write mode allows extension of 
the markup tags and processing thereof to other 
components or applications on the client device. The 
string can be obtained from the "smax" message object. 

> status: Read-only. Status code returned by the 
recognition platform. Possible values are 0 for 
successful recognition, or the failure values -1 to -4 
(as defined in the exceptions possible on the Start 
method (section 2.3.1) and Activate method {section 
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2.3.4)), and statuses -11 to -15 set on the reception 
of recognizer events (see 2.4). 

2,3 Object m&thods 

Reco activation and grammar activation may be controlled 
using the following methods in the Reco's DOM object. With 
these methods, uplevel browsers can start and stop Reco 
objects, cancel recognitions in progress, and activate and 
deactivate individual grammar top-level rules {uplevel 
browsers only) * 

2.3,1 Staxt 

The Start method starts the recognition process, using as 
active grarcumars all the top-level rules for the recognition 
context which have noz been explicitly deactivated. 

Syntax ; 

Object .Start { ) 
Return value: 

None. 
Exception: 

The method sets a non-zero status code and fires 
an onNoReco event when fails/ Possible failures 
include no grammar (reco status - -1), failure to 

r 

load a grammar, which could be a variety of 
reasons like failure to compile grammar, non- 
existent URI (reco status = -2), or speech 
platform errors (reco status = -3). 

2.3,2 Stop 

The Stop method is a call to end the recognition process. 
The Reco object stops recording audio, and the recognizer 
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returns recognition results on the audio received up to the 
point wheie recording was stopped. All the recognition 
resources used by Reco are released, and its grammars 
deactivated* (Note that this method need not be used 
explicitly for typical recognitions in automatic mode, since 
the recognizer itself will stop the reco object on endpoint 
detection after recognizing a complete sentence.) If the 
Reco has not been started, the call has no effect* 

Syntax : 

Object. Stop { ) 
Return value; 

None . 
Exception : 

None - 

2.3.3 Cancel 

The Cancel method stops the audio feed to the recognizer, 
deactivates the grammar and releases the recognizer and 
discards any recognition results. The browser will disregard 
a recognition result for canceled recognition. If the 
recognizer has not been started, the call has no effect:. 

Syntax : | 

Obj eot . Cancel { ) 
Return value : 

None . 
Exception: 

N on e . 
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2.3*4 Activate 

The Activate method activates a top-level rule in the 
context free grammar (CFG) . Activation must be called before 
recognition begins, since it will have no effect during a 
'Started' recognition process. Note that all the gramar 
top-level rules for the recognition context which have not 
been explicitly deactivated are already treated as active, 

Syntax : 

Object .Activate (str^ame) ; 
Parameters : 

o strName: Required. Rule name to be activated. 
Return value ; 

None , 
Exception : 

None . 

2.3.5 Deactivate 

The method deactivates a top-level rule in the grammar. 
If the rule does not exist, the method has no effect. 

Syntax: 

Object, Deactivate (strName) ; 
Parameters : 

o strName: Required. Rule name to -be deactivated. An 

empty string deactivates all rules. 
Return value 

None. 
Exception 

None r 
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2 . 4 K&ao events 

The Reco DOM object supports the following events r vrhose 
handlers may be specified as attributes of the reco element. 

2.4.1 onBeco : 

This event gets fired when the recognizer has a 
recognition result available for the browser. For recos 
in automatic mode, this event stops the recognition 
process automatically and clears resources (see 2.3.2), 
OnReco is typically used for programmatic analysis of 
the recognition result and processing of the result 
into the page. 



Syntax : 



Inline HTML 


<Reco onReco ^"handl&r" > 


Event property 


Object , onReco - handler; 
Obj ect . onReco = 
GetRef C hand! ex") ; 



Event Object Info; 



Bubbles 


No 


To invoke 


User says something 


Def s u 1 1 
action 


Return recognition result object 



Event Properties: 

Although the event handler does not receive properties 
directly, the handler can query the event object for 
data (see the use of the event object in the example 
below ) ♦ 
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Example 

The following XHTML fragment uses onReco to call a 
script to parse the recognition outcome and assign the 
values to the proper fields. 

<input name="txtBoxOrigin" type- "text" /> 
<input name^ r 'txtBoxDest" type^'text" /> 
<reco orReco^'processCityRecognition {)"/> 

<grammar src» w /grarranars/cit ies . xml" /> 
</reco> 

<script><! [CDATA; 

function processCityRecognition 0 { 

smlResult = event . srcEloment . xecoResul t ; 

origNode ^ 

smlResult . selectSingleNode { M //origin_city") ; 

if (origNode ! = null) txtBoxOrigin • value = 

origNode . text:; 

destNode = 

smlResult, selectSingleNode ( V/dest_city") ; 

if (des tNode F- null) txtBoxDest . values = 

destNode . text ; 

) 

] ] ></script> 
2,4,2 onSilence : 

onSilence handles the event of no speech detected by the 
recognition platform before the duration of tine specified 
in the init ialTimeout attribute on the Reco (see 2.2.1). 
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This event cancels the recognition process automatically for 
the automatic recognition mode. 



Syntax: 



Inline HTML 


<reco onSilence="2?andler' r 


Event property {in 
j^CMAScript ) 


Object .onSilence - handler 
Object . onSilence = 
GetRef Chandler") ; 



Event Object Info: 



Suboles 




To invoke 


Recognizer did not detect speech within 
the period specified in the 
ir.itialTimeout attribute. 


Default 
action 


Set status - -11 



El vent Properties : 

Although the event handler does not receive properties 
directly, rhe handler can query the event object for 
data . 



2,4.3 onTiraecru t 

ozTirneout handles two types of event which typically reflect 
errors f>rom the speech platform. 

* It handles the event thrown by the tags interpreter 
which signals that the period specified in the. xtime 
attribute (see 2.2.1) expired before recognition was 
completed. This event will typically reflect problems 
that could occur in a distributed architecture. 
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♦ -It also- handles (ii) the event thrown by the speech 
recognition platform when recognition has begun bu- 
processing has stopped without a recognition within the 
period specified by babbleTimeout (see 2, 2 A). 

This event cancels the recognition process automatically. 



Syntax : 



Inline HTML 


<reoo onTimeout= // £andier" 


Event property (in 
ECMAScript ; 


Object . onTimeOut = handler 
Ob j ect . onTineOut 
Get Re f ("handler") ; 



^vent Object Info: 



Bubbles 


No 


To invoke 


Thrown by the browser when the period 
set by the isaxtime attribute expires 
before recognition is stopped. 


Default 
action 


Set reco status to -12 . 



Event Properties ; 

Although the event handler does not receive properties 
directly, the handler can query the event object for 
data. 



2,4.4 onHoHeco : 

onNoReco is a handler for the event thrown by the speech 
recognition platform when it is unable to return valid 
recognition results. The different cases in which this may 
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happen are distinguished by status code. The event stops the 
recognition process automatically. 



Syntax: 



Inline HTML 


<Reco onNoReco -"handler" > 


Event property 


Object- onNoReco = handler; 
Object . onNoReco = 
GetRef pbandler") ; 



Svent Object In£o: 



Bubbles 


No 


To invoke 


Recognizer detects sound but is unable 
to interpret the utterance, 


Default 

action 


Set status property and return null 
recognition result. Status codes are set 
as follows: 

status -13: sound was detected but no 
speech was able to be interpreted; 
status -14: some speech was detected and 
interpreted but rejected with 
insufficient confidence (for threshold 
setting, see the reject attribute in 
2.2,1). 

status -15: speech was detected and 
interpretedr but a complete recognition 
was unable to be returned between the 
detection of speech and the duration 
specified in the babbleTimeout attribute 
( see 2.2.1) - 
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Event; Properties: 

Although the event handler does not receive properties 
directly, the handler can query the event object for 
data . 

3 Prompt 

The prompt element is used to specify system output. Its 
content may be one or more of the following: 

• inline or referenced text, which may be marked up with 
prosodic or other speech output information; 

• variable values retrieved at render time from the 
containing document; 

• links to audio files. 

Prompt eleinen-5 may be interpreted declaratively by 
downlevel browsers (or activated by SMIL commands), or by 
object methods on uplevei browsers. 

3,1 Pxcunpt G&nt&nt 

The prompt element contains the resources for system output, 
either as text or references to audio files T or both. 

Simple prompts need specify enly the text required fox 
output, eg: 

<prompt id="Wel come"> 

Thank you for calling ACME weather report. 
</proinpt> 
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This simple text may also contain further markup of any of 
the kinds described below. 

3.1.1 Speech Synthesis markup 

Any format of speech synthesis markup language can be used 
inside the prompt element. (This format may be specified in 
the 'tts' attribute described in 3.2.1.) The following 
example shows tfixt with an instruction to emphasize certain 
words within it: 

<prompt id<="giveBalance"> 

You have <emph> five dollars </emph> left in your 

account > 

</prorapt> 

3.1.2 Dynamic content 

The actual content of the prompt may need to be computed on 
the client just before the prompt is output. In order to 
confirm a particular value, for example, the value needs to 
be dereferenced in a variable. The value element may be 
used for this purpose. 

Value "ETlement: 

value: Optional. Retrieves the values of an element in the 
document - 

Attributes : 

* targetElement: Optional. Either href or ta rgetElement 
must be specified. The id of the element containing the 
value to be retrieved. 

» targetAttribute: Optional. The attribute of the element 
from which the value will be retrieved- 
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* haref: Optional . The URI of an audio segment, href will 
override targetElement if both are present. 

The targetElement attribute is used to reference an element 
within the containing document. The content of the element 
whose id is specified by tar get Element is inserted into the 
text to be synthesized. If the desired content is held in 
an attribute of the element,, the targetAttribute attribute 
may be used to specify the necessary attribute on the 
targetElement. This is useful for dereferencing the values 
in HTML form controls, for example. In the following 
illustration, the M value" attributes of the "txtBoxOrigin" 
and "txtBoxDest" elements are inserted into the tex: before 
the prompt is output 

<prompt id="Conf irm"> 

Do you. want to travel from 
< value targetElement="tKtBoxOrigxn'' 
target Attribute-" value" /> 
tc 

<value targetEie:sent="txtBoxDest" 
targetAttribute="value^ /> 
? 

</prompt> 
3.1.3 Audio files 

The value element may also be used to refer to a pre- 
recorded audio file for playing instead of, or within r a 
synthesized prompt. The following example plays a beep at 
the end of the prompt: 

<prompt> 
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After the beep, please record your message, 
lvalue href ="/wav/beep. wav" /> 
< /prompt > 

3.1,4 Referenced prompts 

rnstead of specifying content inline, the src cat tribute may 
be used with an empty element to reference external content 
via URI r as in: 

<prompt id=*"Welcome" s r c- V ACMBWea the rPrompts#Vvel come" 

/> 

The target of the src attribute can. hold any or all of the 
above content specified for inline prompts. 

3,2 Attributes; and properties 

The prompt element holds the following attributes (downlevel 
browsers) and properties (down level and uplevel browsers) > 

3.2.1 Attributes 

* tts: Optional. The markup language type for text-to- 
speech synthesis. Default is "SAPI 5". 

* src: Optional if an inline prompt is specified. The 
URI cf a referenced prompt (see 5*1 -4). 

* barged: Optional, Integer. The period of time in 
milliseconds from start of prompt to when playback 
car) be interrupted by the human listener. Default is 
infinite, i.e., no bargain is allowed. Bargein-0 
allows immediate bargein . This applies to whichever 
kind of barge-in is supported by platform. Either 
keyword or energy-based bargein times can be 
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configured in this way, depending on which is 
enabled at the time the reco is szaxted, 
» prefetch: Optional, A Boolean flag indicating 

whether the prompt should be immediately synthesized 
and cached at browser when the page is loaded. 
Default is false, 

3,2.2 Properties 

Uplevel browsers support the following properties in the 
prompt's DOM object* 

* bookmark: Read-only. A string object recording the text 
of the las~ synthesis bookmark encountered, 
status: P,iB&d~only> Status code returned by the speech 
platform. 

Innertext: Read-only. This property will provide the 
text transcription of the prompt, which would be sent 
to the synthesizer. For instance, if a prompt comprises 
playing an audio wave file, this property provides a 
~ext version of that prompt {often stored with the 
audio wave file), which can then be displayed, or 
otherwise used, for example, by providing the text 
version of the prompt to a component or application 
running on the client device. The innertext property 
can also be used to provide text versions of prompts 
containing dynamic content. 

3 . 3 pTnjTj^rt: m&thods 

Prompt playing may be controlled using the following methods 
in the prompt's DOM object. In this way, uplevel browsers 
can start and stop prompt objects, pause and resume prompts 
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in progress, and change the speed and volume of the 
synthesized speech. 

3.3.1 Start 

Start playback of the prompt. Unless an argument is 
given, the method plays the contents of the object. 
Only a single prompt object is considered ^started' at 
a given time, so if Start is called in succession, all 
playbacks are played in sequence* 

Syntax: : 

Object. Start ( [strText] ) ; 
Parameters : 

o storTesct: the text to be sent to the synthesizer. 
If present, this argument overrides the contents 
of the object. 

Re turn value : 
None. 

Exception : 

Set status - -1 and fires an onComplet e event if 
the audio buffer is already released by the server. 

3.3.2 Pause 

Pause playback without flushing the audio buffer. This 
method has no effect if playback is paused or stopped. 

Syntax : 

Object , Pause ( ) ; 
Return value : 

lvone. 
Exception : 

None. 



(118) 



WW 2 003-131772 



3.3.3 Resume 

Resume playback without flushing the audio buffer. This 
method has no effect if playback has not been paused. 

Syntax : 

Ob ject. Restore ( ); 
Return value: 

None . 
Exception : 

Throws an exception when resume failts- 

3.3.4 Stop 

Step playback, if not already, and flush the audio 
buffer. If the playback has already been stopped,, the method 
simply flushes the audio buffer* 

Syntax: : 

Obj ect, Stop ( ) ; 
Return value: 

Hone, 
Exception: 

None. 

3.3.5 Change 

Change speed and/or volume of playback. Change may be 
called during playback. 

Syntax: 

Object. Change (speed, volume) ; 
Parameters ; 

o speed; Required. The factor ~o change. Spood-2 . 0 
neans double the current rate, speed^O.5 means 
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halve the current rate, speed=0 maans to restore 

the default value, 
c voliarae: Required. The factor to change. Volume-2 . 0 

means double the current volume, volume -0.5 means 

halve the current volume, volume ^0 means to 

restore the default value* 
Return Aralu^ r 

None . 
Exception: 

None . 



3 . 3 . G Prompt control ©sample* 

The following cnanple shows how control of the prompt using 
the methods above might be authored for a platform which 
docs not support a keyword barge-in mechanism. 

<htiru> 

<tl~le>Frontpfc oontrol</tit ic> 
<head> 
<script> 

< ! 

function checkKWBargein ( ) I 

news . change (I. 0, 0.5); // earn down the volume 
while verifying 

if (keyword. text ~ un ) \ // result is belotf 

threshold 

news. change (1.0, 2.0); // restore the volume 
keyword. Star" () ; // restart the recognition 
} else 1 

news.StopO; // keyword detected! Stop the 

prompt 

// Do whatever that is necessary 

) 

} 

// 

</script> 
cscript for-" window" event^'onload 1 ^ 

< \ — 
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news. Start () ; keyword . Start { } ; 

// 

</script> 

</head> * ■ 

<body> 

<prornpt id="news' 1 hargeAr.^ 1f 0 lf, > 

Stocks turned in another lackluster performance Wednesday 

as investors received little incentive to make any big moves 

ahead! of next week's Federal Reserve xaeeting. The tech-heavy 

Hasdaq; Composite Index dropped 42.51 points to close at 

2156 .26. The Dow Jones rnd.nsfcr±al Average fell 17„05 points 

to 10866.46 after an early -afternoon zrally failed. 

- <! — 
</prorript> 
<reco id= "keyword" 

reject="70 T ' 

on^co™" cliackKWBargeiD <) " > 

<grarmar grc^ htfcp : //denali/news bargein grammar . xml /> 

</reco> 
</body> 
</html> 

3.4 Prompt events 

The prompt DOM object supports the foLlowir,g events, whose 
handlers rr.ay be specified as attributes of the prompt 
element . 

3.4.1 onBookmark 

Fires when a synthesis bookmark la encountered. The 
event does not. panse the playback. 



Syntax : 



Inline HTML 


<prorr.pt onBootork= // handler" 
...> 


Event property 


Object . onBookmark ™ har.dl&r 
Object . onBoooark - 
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GetRef {"handler") ; 



Event: Object: Info: 



Bubbles 


No 


To invoke 


A bookmark in the rendered string is 
encountered 


Default, 
action 


Returns the bookmark string 



Even fc Pr-or>e rties: 

Although the event handier does not receive properties 
directly, the handler cam query the event object for 
data. 

3.4.2 onBargein : 

Fires when a user' 5 barge-in event is detected. (No Le 
that determining what constitutes a barge-in event, eg 
energy detection or keyword recognition, is up to the 
platform.) A specification of this event handler does 
not automatically turn the barge -in on. 



Syntax : 



Inline KTMT, 


<prompt onBargein-"handier" 
...> 


Event property 


Ob ject . onBargein - handler 
Object . onBargein = 
^et Ke £ ( ' v hand! er" ) ; 



Event Object Info: 



Bubbles 


Wo 


To invoke 


A bargein event is encountered 
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Default 


None: 


action 





Event jPr-opeartxes: 

Although the event handler does not receive properties 
directly r the handler can. query the event object fox 
da tE . 

3,4.3 onComplete : 

Fires when the prompt playback reaches the end or 
exceptions (as defined above) are encountered. 



Inline HTML 


<proirtpt onConaplete^'handler'' 
...> 


Event property 


Object, onComplete = handler 
Object. onComplete = 
GetRef Chandl&r") ; 



Event; Object Xafo; 



Bubbles 


No 


To invoke 


A. prompt playback completes 


Default 
action 


Set status - 0 if playback completes 
normally, otherwise set status as 
spec i f i ed above . 



Event Properties : 

Although the event handler does not receive properties 
directly, the handler can query the event object for 
data . 
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3.4.4 XJeringr bookmajcke and events 

The following example shows bow bookmark events can be used 
to determine the semantics of a user response - either a 
correction to a departure city or the provision of a 
destination city - in terms of when bargain happened during 
the prompt output. The onBargein handier calls a script 
which sets a global 'mark' variable to the last bookmark 
encountered in the prompt, and the value of this 'mark' is 
used in the reco's postprocessing function ('heard'} to set 
the correct value. 



<script><! [CDATA[ 
vax rnarx; 

function interrupt { ) { 

mark - event * srcElement . bookmark; 

} 

function ProcessCi tyConf irm () { 

confirm. stop () ; // flush the audio buffer 
if (mark — u mark_ origin_cit y" ) 
txtBoxOr i gin . value ~ 
event * srcElement . text ; 

else 

txtBoxDest . value » 
event , srcElement . text; 
} 

]]></script> 
<body> 

< input name= ;/ txt5cxOrigii3' r value=" Seat tie" 
type-="text"/> 

<input Bame-"txt9oxDest' / t ype~" text n /> 

<prompt id~"conf irm" onBargein="interrupt ( ) rt 
bargein="0"> 

Frons <bookmark mark^^mark^origin^cit^ /> 
<value target£lement="orgin" 
targetAttiibute^^alue" />, 

please say <bookmark mar k^"mark__dest_city" /> the 
destination city you want to travel to. 
</prompt> 

<reco onKeco="Proces5CityConf irm ( ) " > 

<grammar src-" /grin/ 1033/cities . xml" /> 
</reco> 
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</body> 
4 DTKF 

Creates a DTMF recognition object. The object can foe 
instantiated using inline markup language syntax or in 
scripting- lichen activated, DTMF can cause prompt object to 
fire a barge-in event. It should be noted the tags and 
eventing discussed below with respect to DTMF recognition 
and call control discussed in Section 5 generally pertain to 
interaction o.etween the voice browser 216 and media server 

4.1 Content 

» dtmf grammar : for inline grammar. 

• bind: assign DTMF conversion result to proper field. 
At-brxbiate-s ; 

• targetElement: Required- The elenent to which a partial 
recognition result will be assigned to (cf. same as in 
W3C SMIL 2.0 ) . 

• ? targstAttrxbtrbe : the attribute of the targe: element to 

which the recognition resulz will be assigned to (cf. 
same as in SMIL 2.0) . Default is Value". 
» test: condition for the assignment. Default is true. 

Example 1: map keys to text 

<input type«"text" name*="city"/> 
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<DTMF id~"ci-y_choice'' timeout-' 7 20 00 " nuir;Digits= rr l"> 
<dtmf grammar> 

<key ^-aiue-"l">Seattle</key> 

<key value^"2">Boston</key> 
</citmf grammar > 

<bind targetEle:ment-"city" targetAttribute-"value' 

/> 

</DTMF> 

When w ci3y_choice" is activated, ^Seattle" will be 
assigned to the input field if the user presses I, 
^Boston" if Z, nothing other-wise. 

Example 2: How DTMF can be used with multiple fields. 

<input type-"texr" name«"area_code" /> 
<±nput type^tex-^ name=^phone_nnBber /r /> 
<DTMF id="areacode" nurtiDigits=*"'3" 
onReco*"extenGion. Activate () "> 

<bind targetElement="area__code" /> 
</DTMF> 

<DTMF id-^extension" numDigits«"7"> 

<bind targetElement-"phone^nuiubcr" /> 

</DTMF> 

This example demonstrates how to allow users entering 
into multiple fields. 

Example 3: How to allow both speech and DTMF inputs and 
disable speech when user starts DT|MF* 

<input type= "text'' narxie-''credit_card_nu:mber" /> 
<prompt onBookmark«' r ctTnf -Start () ; speech . Start () " 

bargein-"C"> 
please say <:bookmark name^ f/ starting" /> 
or enter your credit card number now 
</prompt> 

<DTMF id="dtir.f" escape^"#" length="l 6" 
interdigitTiirieou-=''2 00G" 

onkeypress^'-'speech - Stop ( ) "> 

<bind targetEleirtent^credit^card^urnber 7 ' /> 
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</DTMF> 

<reco id— "speech" > 

<graimrnar src^'Vgrm/1033/digits . xml" /> 
<bind targetE2ement-"credit_card_number'' /> 

</reco> 



4.2 Attributes and pxop&xtlBss 

4 > 2 ♦ 1 Attributes 

• dtxmf grammar : Required. The URI of a DTMF grammar. 

4.2,2 Properties 

DTMFgrammar Read-Write. 

An XML DOM Node object representing DTMF to string 
conversion matrix (also called DTMF grammar) . The 
default grammar is 

<d t in f g r amma r > 

<key value<="0">0</key> 
<key value="l">K/key> 

<key valued 9">9</key> 
<key value="*">*</key> 
<key value= v #">K/3cey> 
</dtmf grammar > 

flush 

Read-write, a Boolean flag indicating whether to 
automatically flush the DTMF buffer on the underlying 
telephony interface card before activation. Default is 
false to enable type-ahead. 



escape 

Read-Write. The escape key to end the DTMF reading 
session. Escape key is one key. 
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* munDigits 

Read-Write. Number of key strokes to end the DTMF 
reading session. If both escape and length are 
specified, the DTMF session is ended when either 
condition is met. 

dtntfResnXt 

Read-only string, storing the DTMF keys user has 
entered. Escape is included in result if ryped, 

• text 

Read-only string storing white space separated token 
string, where each token is converted according to DTMF 
grammar. 

initialTimeout 

Read-write. Timeout period for receiving the first DTMF 
Jceystoke, in milliseconds. If unspecified, defaults to 
the telephony platform' s internal setting. 

intexrdigitTimeotit 

Read-Write. Timeout period for adjacent DTMF keystokes, 
in milliseconds . If unspecified, defaults to the 
telephony platform's internal setting. 

4.3 Ohj&ot method* : 

4.3.1 Start 

Enable DTMF interruption and snart a DTMF reading 
session. 
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Syntax: 

Object .Start < ) ; 
Return value: 

None 
Exception : 

None 

4.3.2 Stop 

Disable DTMF. The key strokes entered by "he user, 
however, remain in the buffer. 

Syntax ; 

Object. Stop { ) r 
Return value: 

None 
Exception : 

None 

4.3.3 Flush 

Flush the DTMF buffer. Flush can not be called during a 
DTMF session . . 

Syntax: 1 

Object , Flush ( } ; 
Retuirn value: 

None 
Exception: 

None 
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4, 4 Bvants 

4.4,1 onkeypress 

Fires when a DTOF key is press. This overrides the 
default event inherited from the HTML control, ^hen 
user hits the escape key, the onRec event fires, not 
onKeypress . 



Syntax : 



inline HTML 


<DTMF onkeypress=' / iiaj!dler" „> 


Event property 


Object * onkeypress ** handler 
Object . on keypress = 
GetKef ("handler") ; 



Event Object Info: 



Bubbles 




To invoke 


Press on the touch-tone telephone key 


Default 
action 


Returns the key being pressed 



Event Properties : 

Although the event handler does not receive properties 
* directly, the handler con query the event object for 
da ta . 



4.4.2 onReco 

Fires when a DTMF session is ended. The event disables 
the current DTMF object automatically. 
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Syntax : 



Inline HTML 


OTMF onReco=" handler" „> 


Event property 


Object .onReco » handler 
Object , onReco - 
GetRef ["handl&r") ; 



Event Object In£o: 



Bubbles 


No 


To invoke 


User presses the escape key or the 
number of key strokes meets specified 
value , 


Default 
action 


Returns the key being pressed 



Event Properties : 

Although the even- handler does not receive properties 
directly, the handler can query the event object for 
data . 

4,4,3 onTimeou t 

Fires when no phrase finish event is received before 
time out. The event halts the recognition process 
automatically - 



Syntax : 



Inline HTML 


<DTMF onTimeout=" handler'" „> 


Event property (in 
ECMAScript) 


Obj ect - onTimeout = handler 
Object . onTimeout = 
GetRez {"handler") ; 



Event Object Info r 
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Bubbles 



No 



To invoke 



Mo DTMF key stroke Is detected wichin 
the timeout specified. 



Default 



None 



action 



Event Properties ; 

Although the event handler does no: receive properties 
directly, the handler can query the event object for 
data . 

5 CallControl Object 



Represents the telephone interface {call, terminal, and 
connection) of the telephone voice browser. This ooject is 
as native as win cow object in a GUI browser. As such, the 
lifetime of the telephone object is the same as the browser 
instance iLsei£. A voice browser for telephony instantiates 
the telephone object, one for each call. Users don't 
instantiate or dispose- the object. 

At this point, only features related to fizsi-party call 
controls are exposed through this object. 

5 , 1 Properties 
address 

Read-only. XML DOM node object. Implementation 
specific. This is the address of the caller. For PSTN, 
may a combination of AN I and ALL For VoIP, this is the 
caller's IP address. 
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r ing sBef o r&Ans wer 

Number of rings before answering an inconiing call. 
Default is infinite, meaning the developer must 
specifically use the Answer; ) method below to answer 
the phone call. When the call center uses ACD to queue 
up the incoming phone calls, this number can be set to 
0. 

5.2 M&tha&s 

Note: all the methods here are synchronous. 

5,2-1 Transfer 

Transfers the call. For a blind transfer, the system 
may terminate the original call and free system 
resources once the transfer completes. 

Syntax : 

telephone. Transfer {strText) ; 
Parameters ; 

o strText: Required. The address of the intended 

receiver. 
Return value : 

None . 
Exception : 

Throws an exception when the call transfer fails- 
e.g., when end party is busy, no such number, fax or 
answering machine answers. 

5.2.2 Bridge 

Third party transfer. After nhe call is transferred, 
the browser may release resources allocated for the 
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call. It is up to the application to recover the 
session state when the transferred call returns using 
strUID* The underlying telephony platform may route the 
returning call to a different browser. The call can 
return only when the recipient terminates the call. 

Syntax : 

telephone- Rri rige (st.rText, strUID, [imaxTine] ); 
Parameters : 

o strText: Required. The address of the intended 
receiver. 

o strUXD: Required- The session ID uniquely 

identifying the current call. When the transferred 

call is routed back, the sxtUID will appear In the 

address attribute, 
o ijnaxTime : Optional, Maximum duration in seconds of 

the transferred call. If unspecified, defaults to 

plat f orrri-internal value 
Return value : 

None. 
Exception ; 

Klone - 

i 

5.2.3 Answer 

Answers the phone call- 

Syntax : 

telephone. Answer { ) ; 
Return value: 
None . 
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Exception : 

Throws an exception when there is no connection. 
No onAnswer event will be fired in this case. 

5.2.4 Hangup 

Terminates the phone call. Has no effect if no call 
currently in progress. 

Syntax : 

telephone . Hangup { ) ; 
Return value ; 

None , 
Exception : 

None. 

5.2.5 Connect. 

Starts a first-party outbound phone call. 

Syntax : 

telephone .Connect [strText, [iTiicieout] ) ; 
Parameters ; 

o strText: Required. The address of the intended 
receiver . 

o illmeout: Optional, The time in milliseconds 
before abandoning the attempt. If unspecified, 
defaults to platform-internal value. 

Return value: 
None . 

Exception : 
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Throws an exception when the call cannot be 
completed, including encountering busy signals or 
reaching a FAX or answering machine {Note: 
hardware may not support this feature) . 

5*2.6 Record 

Record user audio to file. 

Syntax : 

telephone, Record (url, endSilence, [its ax Timeout] , 
[initialTirr.eout] } ; 
Parameters : 

o url: Required. The url of the recorded results. 

o endSil^nce ; Fleguired. Tine in milliseconds Lo stop 

recording after silence is detected, 
o uia^Timecut: Optional. The rnaxiraurr. tixne in seconds 

for the recording. Default is platform-specific, 
o iuitialTimeout: Optional. Maxirnum time (in 

milliseconds) of silence allowed at the beginning 

of a recording. 
Return value: 

None . 
Exception : 

Throws an exception when the recording can not be 
written to the url. 



5.3 Erent Haji ell ers 

App developers using telephone voice browser may implement 
the following event handlers. 
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5 . 3 . X onlnconting ( ) 

Called when the voice browser receives an incoming 
phone call. All developers can use this handler to read 
caller r $ address and invoke customized lea Lures before 
answering the phone call . 



5.3.2 onAn3wer( } 

Called when the voice browser answers an incoming phone 
call. 



5.3.3 onHaagup ( ) 

Called when user hangs up the phone. This even:. Is ttOT 
automatically fired when the program calls the Hangup 
or Transfer methods . 



5. 4 Example 

This example shows scripting wired to the call control 
events to manipulate the telephony session. 



<HTML> 
<KEAD> 

<TTTL,B>I»agori Page</TIIXE> 

</HEAD> 

<SCKIPT> 
var focus; 

function RunSpeech 0 ( 

if (logon- user. value — " ") { 
focus= "user" p 

p_uid. Start { J ; g_login . S~art (} ; dtmf - Starr. ( ) ; 

return; 
} 

if ( logon. pass . value ™ nrr ) ■ 
focus="pin"; 
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pjpin.StaxtO ; g^login. Start () ; dtmf. Start {) ; 

return; 
} 

p_thank . Start ( } ; logon . submit ( } ; 

3 

function login_reco() { 

res - event . srcElercent. recoResul^ ; 
pNoda - res, selectSingleNode( V/uid'M ; 
if (ptiode 1= null) 

logon. user . value pNode *xral; 
pNode ** res. selectSingleHode ( " //password" ) ; 
if (p^ode ! = null) 

logon. pass, value pKode,xml; 

} 

function dtnrf_reco ( ) { 

res - event. srcElement . dtmf Result ; 
if (focus ~ "user") 

]ogor..user.yalne =* res; 

else 

logon. pin. value = cesr 

} 

</SCKIPT> 

< SCRIPT for-^callControl" event= n onI ncoming 1 ' > 
< i — 

// read address r prepare customized stuff if any 
caliControl .Ms»er ( ) ; 

// 

</SCRIPT> 

<SCRIPT for^'^allControl" event-" onOff hook" > 

<} — 

p_„-nain. Start () ; c_login . Start ( } ; dtmf - Start ( ) ; 
focus- n us er " ; 
// 

</3CRIPT> 
<SCRI?r for^'Vindow" e?ent="onload ,, > 

<! — • 

if (logon, user ."value !- ) I 
p retry - Srart ( ) ; 
logon- user .value = nrt ; 
logon . pass .value = " " ; 
checkFields 0 ; 

) 

// 

</SCRIPT>- 
<BODY> 

<reco id= n g_login" 

onReco""login_x©co {) ; runSpeech () " 
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rimeout^"5000' f 

onTiir.eout~ 11 p_miss . Start () ; KunSpeecn () " > 

<gramniar 

src^htfcp : //3co3can©el /etr-adedezno/speechonXy/ login .3aatX/> 
</ reco > 
<dtraf id==" dtiaf" 
escape-^ 

onkeypress«*"g_login, Stop<) ; JT 
onEeco- r 'dtiiif_xeco() ; .Run Speech () w 
interdigitTlmeout- r 5000" 

ontimeout^'dtraf .Flusn () ; p__miss . start <) ;RunSp^eok () " /> 

<proxapt .;. d= "pjrnain n >Pl©aso say your us©r X D and pin 

n uiaber < / p r orap t > 
<proiapt id="p_uid">Pl^as© Ju&t stay your user I D</proinpt> 
<pr ompt id~ n p_j>±n">Flease just say your pin number</prompt> 
<prompt id="pjniss v > Sorry , I missed tnat:< /prompt > 
<promp~ .i d^"p_thanlc">Than3c you. Please wait while X verify 

your identity</prcmpt> 
<proirpt id="p_r<stry Tr >Sorry r your user I D and pin number do 

not mafcch</proitpt > 

<H2>I>ogin</H2> 
<f orm id-*lo^oxi"> 

uid: <1npt;" name-"user" type -"text" 
onChange~"runSpeech {> " /> 

PIN: <input n&ir.e-^pass" type~ Tt pas3Word ir 
onChange^ n Run£peech {) " /> 

</z orm> 
</B0DY> 
</HrHL> 



6 Controlling dialog^ flow 



5,1 Cfexng- HTML and script to linpl exaent dialog- fioTr 

This example shows how to implement a simple dialog flew 
which seeks values for input boxes and offers context- 
sensitive' help for the input. It uses the title attribute on 
the HTML input nechani sms (used in a visual browser as a 
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^tooltip" mechanism) to help form the content of the help 
prompt . 



<html> 

<title>Coate;Kt ScnsitxTe Help< /" Lt.le> 
<head> 

<" script > var focus; 

function Run Speech () { 

if {trade.. stock. value — { 
focus= ,f trade* stock 11 ; 
p_stock + Start () ; 
return ; 

} 

if (trade. op. value — ,1>T ) i 
focus—" trade « op " ; 
p_op. Start {> ; 
return ; 

> 

repeat above for all fields 
trade . submit () ; 

J 

function handle {) { 

res — event . srcElernent. recoRe.£i*lt ; 
if (res.text — "help") K 

texrfc « "Please just say" ; 

text t= document. all [focus] .title; 

p help , S tar t ( text } ; 
} else { 

// proceed with value assignments 
} 

} 

</ script > 
</head> 
<body> 

<promot id-"p_heip r> cnConplete= ,J checkFileds (> Tt /> 

<pronipt id- n p_stook ?t onComplete-"g_stoc3c. Start () n >Please say 

the stock namec/prompt > 

<prompt Ld-"p_op n onCorapicto*"g - _op. Start C) ">Do you want to 
buy or sell< /prompt > 

<prompt id-'V___<£uantity'> onComplet e- ¥t g_qnantity . Start () 11 >How 
many shares? </prorap-> 

<prompt id-"p_price n cnCompl ete- rr g_pric©-. Start {) " >What' s the 
priae</prampt> 
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<reco id= ?, g_stock v onReco— "handle () ; checkFields {) " > 
<granimar src= M , /g^stock.xml" /> 

</ reco > 

<reco id="g_op" onReco="Handle () ; checkFields { ) " /> 

<grantmar' s rc= " . / g_op . xml n /> 
</ reco > 

<reco id-"g_quantiiiy JT onReco - M handle {) ; checkFields {) " /> 
<grammar s rc- ,T . /g^quant . xml " /> 

</ reco > 

<reco id="gj>rice n onReco="handle {) ; checkFields {} T1 /> 
<graramar s r c= ,T . /g^quant . xml " /> 

</ reco > 



<form id™" trade"> 

<input name="stock tr title- "stock name" /> 
<select name-"op" title= tr buy or ssell"> 

<option value^'^uy" /> 

<option value="sell" /> 

</sel ect> 

<Input name="qTxantity" tit le^ ir miini>er of shares" /> 

<input narne= n price T ' title-^price" /> 
</form> 
</body> 
</html> 

Using- SMIZ 

The following example shows activation of prompt and reco 
elements using SMIL mechanisms , 



<html xmlns : t-^urn: scheimas-rr.icrosof t-com: time" 

xmlns : sp="urn : schemas-microsof t-com; speech 1T > 

<head> 
<style> 

.time { behavior: url { #def aul t# time2 ) ; ) 
</style> 
</head> 
<body> 

<input naroe^ T, txtBoxOrigin n type ss "text , 7> 
<input name= n txtBoxDe3t " t ype^" text" /> 
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<sp:prompt class= n tiiue" z :begin="0 M > 

Please say the origin and destination cities 
</sp: prompt> 

<t:par t :begin= ,r time . end ,r t : repeatCount^ n inde£ initely" 
. <sp:reco class^'t ime" > 

<grammar src=' f . /city . xml" /> 
<bind taxgecElercent-"txtBoxOrigin " 

value-" //origin_ciT:y" /> 
<bind targe LElement- n txtBo^Dest n 
test-"/sml/dest_city [^confidence $gt$ 40]" 
value-V/des~_city" /> 
</sp: reco> 
</t :'par> 

</bociy> 
</htmi> 



7 SMEX (Message) element/object 



3MEX, short for Simple Messaging Exchange /EXtension r is an 
object that communicates with an external component or 
application on the platform of the client device. It can be 
embedded into an XML or similar markup based document as an 
element with the tag name <smex>. Exemplary usages of -the 
messaging object may include logging and telephony controls 
The object represents the extensibility of markup based 
recognition and prompting as it allows new f undtionality be 
added through messaging. 

Upon instantiation, the object is directed to establish an 
asynchronous message exchange channel with a platform 
component or application through its configuration 
parameters or attribute specifications* It has a string 
property whose content is sent to the platform component or 
application whenever the property is the recipient of an 
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assignment operation (i.e., lvalue). Similarly r it also has 
a property of KML DOM Node type that holds the message 
received from the platform component or application. The 
message object sends an event whenever it receives a 
plat term message. Since its basic operations are 
asynchronous, the object also has a built-in clock for the 
application developers to manipulate timeout settings. 

The message or smex object is agnostic to the means of 
coTOiuni cations. However, in one embodiment , the smex object 
has the same life span as ordinary XML or markup elements, 
namely, the smex object will be destroyed when its hosting 
document is unloaded. While in many cases, the smex object 
can perform automatic clean-up and release communication 
resources when it is unloaded, there might be use cases 
(e.g, r call controls) in which a persistent communication 
link is desirable across markup pages. For those cases, the 
architecture places the responsibility of relinquishing the 
allocated resources {e.g. close the socket) on the 
application developers. 

The smex object is neutral on the format (schema) of 
messages. In seme embodiments/ it may desirable to require 
implementers to support a few rudimentary schemas, with 
strong preferences to existing standard messages formats 
(e.g. as used in SIP or CCXKL) . In essence, the architecture 
allows both "he platform and application developers to take 
the full advantage of the standardized extensibility of XML 
or similar markup to introduce other features without losing 
interoperability in the mean time. 

Example X: use of smex as a logging object 
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<smes id-"logServer' r > 

<param nanie===''d: server" xmlns : d-" urn : Microsoft . com/COM"> 
<d : prot oca 1 >DC(M< /d: protocol> 

<d:clsid>2093093029302029320942093432098</d:cisid> 
<d: iid>09038 59304 9034 9853098 5309094 303</d:iid> 

</param> 
</smex> 

<listen ...> 

...// other directives binding reco results to input 
fields 

<bind targetEIeraent^logServer" targetAttribute»"sent" 
value-"* [Slog $ge$ 3] "/> 

</listen> 

This example demonstrates how a logging mechanism can be 
achieved using a COM object with its class id and interface 
id. The speech developers attach an attribute "log" 
indicating the level of interests for logging to the 
relevant SML nodes. In -he example above, the app developer 
chooses to log all nodes with log value greater or equal to 
3 by using a single bind directive. The example works in 
both downlevel and uplevel browsers. 

The example also intends to demonstrate it is possible for a 
page to contain nultxple smex objects communicated with the 
same platform conpenent as long as there won't be confusion 
on which smex object is responsible for delivering the 
platform messages back to the recognition document. The 
above example implies a component can implement multiple 
interfaces, each of which has its own sme>; or message 
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conduit. The same argument applies to TCP servers listening 
to multiple ports. 

Example 2; Reading the addresses for an incoming call: 

<input type="text ,jr ±d-"remote"/> 
<input type="text" id~" transfer "/> 
<input type^"text" id-"iocal"/> 
<input type^"hidden" id-"session_jid"/> 

<smex id-''telephone" sent-"start_lls Lening''> 

<param narae="server">http://tel-3vr/whatever</param> 
<tainc targetElement="sessaon_id' / value==' r //sid r V> 
<bind target El ement-Temote" value»"//remote_addr'V> 
<bind target£lement-="transf er" 

value^'V /trans fer_addr"/> 

<bind targetBlament^local" valuer" //local__addr"/> 

</smex> 

This example shows how the bind directives can be used to 
process the received message* The example assumes a message 
fox incoming call to have sub-elements remot e__addr, 
transf er_addr, and local_addr whose contents represent the, 
remote, transfer, and local addresses of the incoming call, 
respectively. 

In this example, a connectionless protocol based on HTTP is 
used to communicate with the telephony server. The telephony 
server here is designed to coiranunica te with more than one 
browser instances, and hence each client must identify 
itself with a unique id assigned by the server when the app 
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starts. This is achieved in this example by sending the 
server a "start^listening" message- In this example, the 
session id is stored in a hidden field that can be sent back 
to the Web server and passed on to the next page of the 
application, although other techniques (e.g. client side 
cookie) can also be used to manage session state- As is in 
the case for reco, not every bind directives will be 
executed for every platform message. The above example does 
not irnply the unique id will only be received when there is 
an incoming phone call. 

7 . 1 Pxopax ties 

The smex object can have the following properties, with only 
the Read/Write properties allowed to also serve as 
attributes for the initial value specification. 

* sent: Read/Write, a string corresponding to the message 
to be sent to the platform component. Whenever sent is 
used as a lvalue, its contents are dispatched. There is 
no effect when the property is used as a rvalue or when 
a null object is assigned to this property. 

* received: Read Only, an XML DOM Nude data indicating 
the received message. The ne.ssage will be available as 
b rvalue until the next onReceive event is ready to 
send. 4 

* -timer: Read/Write, a number in millisecond indicating 
the time span before a timscut event will be triggered. 
The clock starts ticking when the property is assigned 

a posi-ive value. The value can be changed when a count . 
down is in progress. A zero or negative value stops -he 
clock without triggering the zimeout event. The default 
is 0, meaning no timeout. 
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• status: Read Only, an integer indicating the* recent 
status of the object- The possible values are 0, -1, 
and ~2, which means normal, timeout expired, and 
communication with the platform cannot be established 
or has been interrupted, respectively. Platform 
specific error messages should conveyed through the 
.received property. For the cases that the error message 
is successfully delivered, ™he status code is 0. 

7.2 Events 

The object has the following events; 

• onReceive: This event is sent when a platform message 
has arrived- If there are any directives declared by 
the bind elements, those directives will first be 
evaluated before the event is fired. Prior to the 
sending, the received property will be updated. 

• onError: This event is sent when the timeout, expires, 
or a communication link error has been encountered, 
ifthen the event is sent, the status property will be 
updated with a corresponding error code as described 
above - 

When assuming an element form, smex may have the following 
child elements: 

• bind: same as in reco r except the directives are 
operated on the received message, 

• papain: same as in reco r provides platform specific 
parameters for the smex object. Each param element may 
be named using a "name" attribute, with the contents of 
the param element being the value of the parameter. In 
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one embodiment, the element should understand standard 
XML attributes for name apace and XML data type 
declaration . 



7.4 Pth&x- Cox&m&nizs 

One elegant way to extend SMEX for the logging function 
would be 

<smex id="log5erver" »> ... </smex> 

<script> function iogHessage (logClass , message) { 

logServex. sent « logClass 4- * | " + message; 
} </script> 

This in effect extends the object with a (global) # 
function whose behavior can be customized. In the 
above example, the logging function is programmed to 
insert: a field separator " \" between the id and the 
message . 

For those who do not like global functions, they can 
use the "protot ype" property of BCMAScript to attach 
the function as an object method. For example 

<smex id«"logServer" onload= "addFunct ion ( ) "> ... </smex> 
<script> 

function my_logMessage ( logClass , message) { 

logServer- sent - logClass + * i " + message; 

} 

function adcFunction ( ) { 
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logServer .prototype . logMessage ~ 
my__logMessage ? 
) 

</script> 

One can refer to the function in a more objected- 
oriented manner: 

logServer. logMessage (R3COJLOGJERR0R, "My 
message") ; 

It is noted that more work is required from the sitiex 
object implementers in order to make the extension work 
as in the above example, although all the necessary 
mechanisms are already well established standards. 
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4, Brief Description of the Drawings 



FIG. I is a plan view of a first embodiment of a 
computing device operating environment* 

FIG. 2 is a block diagram o£ Lhe computing device of 
FIG . 1. 

FIG, 3 is a plan view of a telephone* 

FIG. 4 is a block diagram of a general purpose computer. 

FIG. 5 is a block diagram of an architecture for a 
client/server system, 

FIG. 6 is a display for obtaining credit card 
information . 

FIG . / is a page of mark-up language executable on a 
client * 

FIG. 8 is an exrernpl a ry p^ge of mark-up language 
executable on a client having a display and voice recognition 
capabilities . 

FIGS. 9A and arc an exemplary page of mark-up 

language executable on a client with audible rendering only 
and system initiative. 

FIG. 10A and 10B are an exemplary page of mark-up 
language executable on a client with audible rendering only 
and mined initiative. 

FIG. 11 is an exemplary scripz executable by a server 
side plug-in module. 

FIG. 12 is a pictorial illustration of a first 
operational mode of a recognition server, 

FIG. 13 is a pictorial illustration o£ a second 
operational mode of the recognition server. 

FIG. 14 is a pictor.ial illustration of a third 
operational mode of the recognition server, 

FIGS. 15A and 155 are an exemplary page of declarative 
marfc-up language executable on a client without scripting. 
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290' 



272 



<htm\> 

<form rd^'get^c&rd^ittfo" & ethod= "post* actions "http:/fpay m t?nt.a sp' > 

<seicct *ame«*eard.„tYi>** onCMck** 'talk(g_ca rd^fcy pes)' > 

<opt*o& va|^- h amc.)t'>Am€n<art Expre*s</optlort> 
< option vaiue=* "visa" > Vlsa<J<>pt\on> 
<optfort value* t *ms n > MasterCard < /op tton> 
</seiect> 

<reco Jd- ,r g_<:ard_types 1, on&ecp ^haitdleC) " :> 

< grammar src ^ V^ra m #c subtype* 1 * /> 
</reco > 

<ioput type** "text* nam e= "'card^.riu m * w?dth = "30"* 
on crick = *taifk<;ii_car«»_rj«m)* /> 
<reco " g^card^rtum * onReco « °hatidie() w > 

< grumrnar src » M -V 9 r * m #dlgits - /> 
</reco > __ 

-cfnput type- a t**f namc= 'fexpfry^da te" — -■ ■ 

on Click »* ,, ta^k(g_exp^ry_d»t«^} ,, /> 
<reco id- "g^expiry^dafce h > 

< grammar src - + * ./gram #dates * /> 

<ifrput typ«= Submit" valu e « "8ut>ra it* onCticK= *v<*rifyO" /> 



-280 



282 



— 284 



</form> 
<scnpi;;> <rf [COATAf 

function fc<alk{gobj) { 
gobj . activate t) ; 

\ 

function handle O ( 

if (cret__caxd__in£o-caird_nLirfj ■ = null} I 

if {get_cajrd_i.nfo . card^type „ value -*» ") { 

if ( get_e ax d_ijj£o.ca;rd_n urn* length \*~ 15) 



) else 



alert ("amex should have 15 digits *}i 



if {ge t^card^inf o . eardrum .length I** 16} 
alert ("visa and master should have 1 6 digits'*); 



function verifyO i 

v&r flag - window , confirm ( "sufami t the credit card 
in£o? H ) ; 

if (flag) f 

go t^ca rd^info .submit | ) j 



J 



</scrlpt> 
</htmJ> 
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< b ?(orm Id o*t_card^rtfo - method --p«f acdoo htt9://PW«k- a «p * 

< prompt W--P-W«lo«.^ n*^y6»r credit card </ prompt > 

< prompt Id P «»n»ble *> I did»*t vndmtaad you </ prompt > 

300 < prompt Id --p .c*rd„ t yp* " -■•"•"» What credit card *roy>d you 

< prompt* p.«plrr-4.te * b.r 0 «4. Whirt IP ft. 
date? promp t > 

< prompt Itf ^" p„eor*t:#rtt *> . j „ „ . — * 

^ . rv»»«/> < vaiue setect^" eardrum /> 



— < prompt to ^_*-w*r*,^it* - 

303 -*^^ xh^tyaur « value *«*ect card^typ* M /> < value 

wFfch ssxplratkm date 

< value seiecc- k explry^dot* " /> 
</ prompt > 

< prompt id =- p„ccnfirn) "> Is th\* correct? </ prompts 
<reco id-" a «f«Ijyp« " ooWoR*ei» «, urn bi«< fch**, a > - 

Sec -- y "haM«WftWft«rt.typ«) / 

< grammar sre « * *tcard_tyr>e* 

</r«co > . onNo ft eC G mum tjte(this, 1 ) "> 

c rcco id ™ s„card_nwm . 
oaRcco cartf_rium) ;> 

< grammar sre** ,/«iraim#dlglt5 " /> 

*rJcS*W-" 9 expired*!* " «w»WoR«eo rrtumW-Ctht*, 1) ** 

onReco ~ha«tfle{thl*, J%*Z r * a ,?> " " " " " 

< ^rammsr pre-* V tr ram # date* f> 

</reco J> & »R*eo confirm *d<thii) " /> 

< reCO W ™ confirmation 

«= 9r 9 mmer sre * * ram #ir**n* " /> 

/select name «" carrt_typ* ? 

< option value --a«ie* "> Aw eric*« expr*** </ option > 

< option vafce visa "> Vi*» </ option >■ 

< option va*oe ^iflS"> MasterCard </ option > 

< f select > 

< fnpui type ■*texf pame card__num - vrtfth -'30 • /> 
- input type text* r*am e - ' h *5£p*ry_ <J »te • /> 

< input type ^ submit * value Submit " f> 
</ form > 

< script > < f[COATA| 

repeat 3 Or 
checkfilledt) / 

1 function fr.ouable Igiob^, ^pr^P^J { 

P n mnb 1 c - active U ; 

i 
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function .JiandleO { 
handieO: 

* function checkFflledC) { 

if (cardLtype .value *« *"") i 
p_card_type » act** veO ; 
g^cartLtypes .activateO ; return; 

> 

if (eardrum, value —""J { ^ «. . 

p„card_nym, active CD : ^carcLnum.accI vateO? return, 

^ if <expiry_date. value — "*) { „ 

p^xpiryjdara.sctiveOf g«expfry_jdate.actwat*0; return; 

p_content* activate O ; 
p^conf i rm . a eti va t e O i 
conf 7 nnati on . act? vate C) ; 

> 

function conf i rmed(gobj) { 

-if Cgobi.recogfces.Text yes ) 
gat^card^info* submit O i 

// user codes start here 
function handle O f 

if <get_card_info.card_type «- "amejO { 

if Cget-xardj ofo ^ .car cLnum. length t- lO f 

prompt .speak Came* sKouJcJkve IS digits >r 
Qet_ca.rd„irtfo.card_ntim - ; 

J 

i j>i 

if C^t_card_info,card_nufn. length J« 16) { 

prompt. speak C r visa and master should have 16 digits }, 
get^card^nfo.carcLnum = ""; 

</scnpt> 
</body> 
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<:b ° C ^form id^oeO^rd^nfo" method -"post" ac*ioi*=*http://paymentasp'' 

<proropt td^p^weicomeVWe now need your credit card</prompt> 

<pmmpt ttN*p_mumbIe rt >2 didn't understand you</pronipt> 

<protnpt ld^p_card„type* ba^ein-^tme^What credit cart! would you 

<promptid- w p_eo£piry_dafce w bargein-true^What the explrabort 
date?</f>rampt> 

< prompt i d- n p_content"> . , 
I have your <^lue select^ h C3rd_typeV><va t«e select^ card^num /> 

with expiration date <value sdec±=*expiry„date w /> 

</prompt> 

<prompt W*>_confirm h >ls this correct? </promp t> : 
<reco ^^et^card^info" onReco^handtep* onN 0 R*co ^rmimbl e (this> > 

^grammar src ^Vsram/getPayrnefir /> 

<bind target ="card„type" value^/cord/typftV^ 

<blnd target^*card_num" valu^V^rd/rtumoer /> 
405 <bmd taroet^«ixpiry w date" vatiie=V<saird/expr_date /> 

<r^°^"conf1rmatio«" onReco ^connrmedtthte) onNoReco««mumbieCthi*r /> 

<grammar src Vsram#yeisnoV> 

SfiSect na me= "card Jtype*> 
■ <optton value« h ame3r>A«iencan Exprcs»</opbon> 
^option value^ n vi5a w >Vtsa<^p«on> 
«r option value^ms">WasterCard</apt>on> 

<frt^rt^pe==nexr name* H card_Rum n - widths "30" /> 
<fnput type-text" name^ t, expiry_date w /> 

< input tYpe^sabrrplt'' value = "Submit* 1 /> 
</fbrm> 

<SCTfpt><![CDATAf r ^ 
Function weTcomeO C 
P_we1 come . active O ; 
repeat ** 0: 
checkFilleoO; 

function murnbleCgobj) { 
gobj,deactivateQ; 
p_mun)bl e , acrcweO ; 



p_mun)bl e , a. cm veO ; 

checkFinedO; /^7>£ 1DA 

function ^handleO £ 7 ' XC7 * JUrT 

bandleOj 
checkFiHedQ; 
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function checkFitledO t 

lf ^plc^^^e.acrivsQ;; do_field>activateO; return; 

If aMrym^^ 0 i d0 _ fl - eld . activateO; retHrn: 

?f t^J^^J^oS do„field.activ a teO : return; 

jLcontent. activateQ; 
puconf 1 n». activate O ; 
confirmatf on.actlvateC) ; 

lunction confirmed tgobjtf { - 
if <*gobj,recogftes v text ~~ yes 

get_carcLi nf o . subnn t Cgensntl O} » 

^/ us&r codes start here 
// 

function handleO { , . J . f 
if Cfie"M «« getL.carOnfo*card.num> i „^ . 
if Cset-jcarcL,info»card^num. length 15> \ 

prcwt.spSak Can** should Tiave 15 dibits*) r 

get_card w info. eardrum « ; 

* e if e C9etucarct.info.cardLjiumJength i~ 1<0 JC 

prompt-speak C'Visa should have IS digits ); 
get^cardlinfo. card_nuro « 



function gensjnlQ -f 

$xr = *<sral ><c red it^ca rd type** t £ 



str card^type. value; a£tr +~ ;><nuisber> ; , 
str cardl>iumber, value; str '</nurAber><ext>ire> ; 
str expiry^date. value; str +=t 
■ </exjji r e^/ered1tL.card></sro7>" ; 



return str; 
> 

]}> 

</script> 
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SAKPLE £-SP+ PAGE 

<%6 Page language-script:^ Aut*^t*ir«p--talJe- Inherits ^C^dl^Trarxsecticn" %> 

O --^P^page for both voice-only fi auItiitfWUl credit card txanple — > 
<zscript> 

function handle {) { 

if (field =*** get_C5trd_in.f o - card_nujn) { 

if (got card info, car djnunu length !» 15) I 
prompt", speaV ("ai&e* should iiave IS digits**); 
get_card_info.card_nurn = M "; 

) 

* e ££ & fget card_inf o . card_nuwi- length ! 53 16) { 

prompt! sp^ak i w vjlsa Should ftave 3-5 digits") ? 
get ca rd__i nf o . C3 rd_num ** H ? 

) 

function gensml O I 

etr - '<£rcl><credit card><card_i;ype> ; - 

str card type. value; stt +« *</c«Lrd_.c>T>e><nunfl>ex> ; 

str card""n««ibec.^lue; str +- '</nu»b»*><«pire> ' , 

str +« explicate, value; str +- »</«Kpir«></cr«Jlt_caHto-</«i> , 

return str; 

1 

</script> 

<script 3cunat**5erver**> 
function Page^koad <oi>j, args) I 

if ™Hdator ='n«w SyM- -Speech. SMLValidaCor E» ./CreditSDL . »1 "} ; 
dsml « validator. Evaluate <args) ; 
Navigate (ChoosePage (dsml) 3 f 

] else { 

// initialize fields with args 

} 

} 

</script> 
</Uead> 

<sptecb-.form 4d=«gat_card_.iAfo« ****** J*™" 

gzaMraar-'-./gcamlcard^tSfipes" onFhrase*i^$h- handle () 
<option>American Express</opt±on> 
<optionWi sac /options 
<option>^a^tercard </option> 

</ s pee ch: choi ce> ^^^ptM,* sav the nunibei:" 

<gpeechitexT:bo* najne-"C4rd_nttiitoe*" prompt- Pleas* s ay the nui^ 
frranimar-^./aram+digits" oriFliras&FxnjLSh^ handle U > 

gramffla^./gcwWates" onPbrase Efiftlsh- "handle () /> 



</speech : form> 
</bt>dy> 
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<html> 
<boc3y> 

■-?<^Q <l~ the daLa section > 
^ <forra Id="get drink" > 
\j <input narae^ "drink" f> 
& <input typc= "radio* name** "c ream *7> 
<input type^radio" naine*=" sugar "/> 
<inpub name-"u3d" type-"hidden r V> 
</form> 

•?C^ <j — xhe speech section — > 

\ <uroir:pt id«*welcome">welcome, caller I </prompt> 
^ <prompt id= rf ask"> Do you want coke, coffee, or orange juice? 
</pronip r -> 

< F rompt id-"con£irsn"> I hfiard <vaiue href-"dr Lnk'V> - Is ^his 

correct? < /prompt > 
<proinpt id-"tnanks"> Thank you. Please wait wh«n J get it for 

you, </prompt> 
<p r auipfc id-"retry"> Okay, let's do this again </prompt> 
<prompL j.d-"re^rompt rr > Sorry, 1 missed that. < /prompt > 
^prompt id^"cream_^ttg<ir /r >Oo you want cream or sugar with your 

coffees ?< /prorop t > 

<reco id **" reco^dr Lnk' r ;><gra:mnar sec-" « /drink Lypes'V> 
Jfi^f — - <bind tes~~V ^confidence £le$ 3.0]" 

* v 7<*^ ■ t./argetElement- r/ reprompt " tarcetMethod==''start" 

J targetEle3ient- / 'ask" targetMethod- w start" 

Sit — targetElement-"reco_ririnJc" targetNethod-"o tarz"/> 
cbind ndst-Vdrink/cof teeteconlidencR $gt$ 1CJ" 

111, targetBloitent*'' drink" value**'' /drink" 

jj7Y - target£lement-"cream_suqar'' targ©tMethod«*stdcf 

^?CV — largetElement»"reco M cream_3ugar*' 
targetMeinod==*'startV> 
<bind tests'"/ (^confidence Sgt$ 10}" 
, ? y£ — — targetElemer-t^'Mrlnk" value** "/drink" 

targetEleznont-'' confirm" targetMethod-^tart" 
>^ tftTgetElement» / 'reco_ycano- target>isthod--s-arL'V> 

<:/reco> 
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<reco id=- ,/ reco_crca?ri_stjgar"><grarnrr : ar src-" . /creajiH- sugar "/> 
<bir.d test~"/ [iconf idence $gt$ 10 and 
^^-6? host C) /get^drinfc/drink = * coffee' j " 

> target£lem£->n!,«=' / cr<sara / ' targetAttribute^'checfced" 

value sB,7 /cream/@valae / ' 
targetElement^s'jgaJC" targe tAttribute="check^d" 

value="/sugar/gvalue" 
targetfclement^confirai*' targetMethod=''start'' 
target EIXement="rRCo_yesno" targetMethod=="start"/> 

<:/reco> 

<reco id«"reco m yosno"> <gxammar src=" , /yesno'V> 
^C"^5 <bind test~"/yes [SconlTtdence SylS 103" 

' v - targets «mant« -r thariks* tar get Wethod-*"s tart" 

X ^ — taL-getBlement-''get_drink'' 

t a rgGtMetinod==" submit'"'" /> 
<bind t<*st="/no or . / [ @conf xdence 10]" 

gtjc> - targatElexent^"ratry" LargetHethod~"start" 

: — targeLElexent-"ask" - argetMethod--""' start " 
^ytv targetEle;ment^''xcco_drink: r ' 

</reco> 

<!~call control section, — > 

<smex J d=" telephone" sent=''start_li3tcning"><p3rarii 
setver^^'ccxmlproc"? ... </param> 
lls u <:foind targetElemenc~''uid"' va 1 ue="/euid"/> 

\ <foind tsst~VCan_f:ormected' r 

> j-fct _ targ«tElement^ ftr w©lcorae" targetMcthod--"start" 

■jt,l^ - . targetElernent-"ask r ' t3rgetKethod«"start" 

7(^3 , tarqet£lentent~"reco drink" 

targetMethod=''start"/> 

</snex> 
</Jx>dy> 
</htm]> 

FIG, 15B 

A markup language for execution on a client 
device in a client/server system includes extensions for 
recogni t ion . 



2 . Represent at i ye Dr awi 



